t 



t 





1 


ATGGATTTTC 




51 


GGCCGCCGAG 




101 


GCCTATGTCG 




151 


CAGTCTTGGG 




201 


CAGGATAAGG 




251 


GTGAATGTAT 




301 


AAAACCTGTA 




351 


TGAACAGCCT 




401 


CAAGGGATCT 




451 


TGCATGAACA 




501 


GCTCATGCCG 




551 


ACTGTCAGTA 




601 


CCATCCCCTG 




651 


TGATGAATGT 




701 


TCAACACTTT 




751 


ATGTATATTG 




801 


TGGTCAGTAT 




851 


CCTACAAGTG 




901 


GTGTATATCC 




951 


AAAGGGAAAT 




1001 


TTCCTGATGT 


Q 


1051 


CCTATCATTA 


':[} 


1101 


AAAGCCAACA 




1151 


AGCTCAGAAC 




1201 


CTGACAACTA 


M 


1251 


TGACAACAGG 




1301 


TTCCACGGCA 


halt 


1351 


GGAGTCAGTG 


iTl 


1401 


CAGTTGTAAT 




1451 


ATGACTTGCA 




1501 


ACAGTGTCGG 


Vi 


1551 


ACCTCTCGGC 




1601 


ACAAGGTGAC 




1651 


CACGGTGCCC 


1701 


GAGGCAAACA 




1751 


TCAAAGGTGA 




1801 


GTGAGCTTGA 



TCCTGGCGCT 
TTCGACGGGA 
TTATGGTGGG 
GACAGTGTCA 
TGCCAGCTCA 
CGGGCCAAAC 
ATCAAGACGA 
CTTTTCCAAC 
AAATGAGTGT 
CTTACGGCAG 
GATGGTTCCT 
TGGCTGTGAT 
GCCTGCAGCT 
GCTACAGGAA 
TGGGAGCTAC 
GAGGCAAATA 
CAGTGCAGCA 
CAAATGTAAA 
CAAAAGTTAT 
GGTACCATTT 
TGGAAGTACT 
CCAACAGGCC 
CCAATTCCTA 
ACCTCTACCA 
TAGCACCAGC 
GTACAGACAG 
ACCTTCAAAT 
C AG AC GAT G A 
TTTGACCATG 
CTGGGAACCA 
CAGCCAAAGC 
CGCCTCATGC 
GGGGCTGCAC 
ACGGAGCAGC 
CAGATCACCT 
AAAAAGGCGT 
AAAAAGGCCA 



GGTGCTGGTA 
GGTGGCCCAG 
AGGATTGACT 
GCCTTTCTAC 
AAGCTGTGTG 
AAGTGCAAGT 
GCACATCCCA 
CCCTGGATCA 
GGCCTGAAGC 
CTACAAGTGC 
GCTCAAGTGC 
GTTGTTAAAG 
GGCTCCTGAT 
GAGCCTCCTG 
ATCTGCAAGT 
TCAATGTCAT 
GCTTTGCTCG 
GAAGGATACC 
GATTGAACCT 
TAAAGGGTGA 
TGGTGGCCTC 
TACTTCTAAG 
CTCCACCACC 
CCTACAACCC 
TGCCAGTACA 
ACCCTCAGAA 
GACTTGTTTG 
AGCAAAGGAT 
GACTTTGTGG 
ATCAGGGACC 
CCCAGGGGGA 
ATTCAGGGGA 
TCTGGCACAC 
CCTGTGGGGA 
TGCGAGGGGC 
GGTCACACTG 
CTGCTCTGAA 



TCCTCGCTCT 
GCAAATAGTG 
GCTGCTGGGG 
GTCTTAAGGC 
CCAACCACGA 
GTCATCCTGG 
GCTCCTCTTG 
CCAAGCCACA 
CCCGGCCCTG 
TACTGTCTCA 
CCTGACCTGC 
GACAAATACG 
GGGAGGACCT 
CCCTAGATTT 
GTCATAAAGG 
GACATAGACG 
ATGTTATAAC 
AGGGTGATGG 
TCAGGTCCAA 
CACAGGAAAT 
CGAAGACACC 
CCAACAACAA 
ACCACCACCC 
CAGAAAGGCC 
CCTCCAGGAG 
ACCCAGAGGA 
AAATATTTGA 
GATCCAGGTG 
ATGGATCAGG 
CAGCAGGTGG 
AAAGCTGCAC 
CCTGTGCCTG 
TCCAGGTGTT 
AGAAATGGTG 
TGACATCAAG 
GGGAGATTGG 
GAACGCTAA 



ACCTGCAGGC 
TCATCGATTG 
CTGGGCTCGC 
AGAGAATAGC 
TGCAAACATG 
TTATGCTGGA 
ACCAAGGCAG 
AGTTTGCCTT 
TAAGCACAGG 
ACGGATATAT 
TCCATGGCAA 
GTGCCAGTGC 
GTGTAGATGT 
AGGCAATGTG 
CTTCGATCTC 
AATGCTCACT 
GTACGTGGGT 
ACTGACTTGT 
TTCATGTACC 
AATAATTGGA 
ATATATTCCT 
GACCTACACC 
CTGCCAACAG 
AACCACCGGA 
GGATTACAGT 
GATGTGTTCA 
AATAGAAAGA 
TTCTGGTACA 
GAGAAAGACA 
ACAATATCTG 
GCTTGGTGCT 
TCATTCAGGC 
TGTGAGAAAA 
GCCATGGCTG 
AGCGTCGTCT 
ATTAGATGAT 
(SEQ ID NO:l) 



FEATURES : 

Start Codon: 1 
Stop Codon: 1837 



Homologous proteins : 

Top 10 BLAST Hits: 



CRA| 
CRA| 
CRA| 
CRA| 
CRA| 
CRA| 
CRA| 
CRAj 
.CRAi 
CRA| 



500000 
460001 
840000 
840000 
780002 
163000 
157000 
157000 
100068 
600000 



98943299 /a 
03872918 /a 
42916333 /a 
42916331 /a 
01656899 /a 
000492958 / 
140328010 / 
140333453 / 
2328934 /al 
46724650 /a 



ltid=gi 
ltid=gi 
ltid-gi 
l x tid=gi 
ltid=gi 
altid=g 
alt id=g 
altid=g 
tid=gi I 
ltid=gi 



I 157951 
! 151281 
I 154302 
I 154302 
I 147636 
i | 95065 
i | 67526 
i | 13124 
7512796 
I 139291 



93 /def 
03 /def= 
48 /def 
46 /def: 
63 /def 
63 /def 
58 /def 
888 /de 
/def=p 
80 /def 



ref |NP_277 
gb| AAK84 39 
gb I AAK9601 
gbl AAK9601 
ref |XP_040 
ref |NP_062 
=gb|AAF2781 
f=ref I NP_05 
ir [ IT17324 
=ref|NP 114 



060.11 nep. 
1 . 1 IAF3970. 
1.11 {AY03. 
0.1| (AY03. 
394.11 epi. 
270.11 EGF . 
2.1IAF1860. 
6322.21 ep. 
hypothetic . 
014.11 fib. 



Score 




E 


1190 


0. 


0 


1147 


0. 


0 


1128 


0. 


0 


1084 


0. 


0 


397 


e- 


109 


396 


e- 


109 


396 


e- 


109 


395 


e- 


109 


385 


e- 


106 


147 


4e 


-34 



FIGURE 1A 



! 



Blast hits to dbEST: 



CRA Number 


gi Number 


Score 


Expect 






CRAI 113000119195127 


gi I 14391046 


1068 bits 


(539) 


0. 


0 


CRA | 110000024586745 


gi I 12320908 


866 


bits 


(437) 


0. 


0 


CRAI 87000000737360 


gi I 7312275 


842 


bits 


(425) 


0. 


0 


CRAI 113000119269070 


gi I 14399233 


831 


bits 


(419) 


0. 


0 


CRA| 113000119283425 


gi I 14400823 


827 


bits 


(417) 


0. 


0 


CRA| 113000119283416 


gi | 14400822 


785 


bits 


(396) 


0. 


0 


CRA| 112000057002060 


gi I 14377756 


680 


bits 


(343) 


0. 


0 


CRAI 147000029699826 


gil 11300389 


668 


bits 


(337) 


0. 


0 


CRA| 117000088322182 


gi 1 12462132 


563 


bits 


(284) 


le 


-157 



Expression 
gi Number 



(Tissue 



gi I 14391046 
gi | 12320908 
gi I 7312275 
gi I 14399233 
gi I 14400823 
gi 1 14400822 
gi | 14377756 
gi|11300389 
gi I 12462132 



Source ) 
Organ 



head_neck 

n e r vo us__ t umo r 

head_neck 

head__neck 

head_neck 

head_neck 

colon 

breast_normal 
placenta_normal 



FIGURE IB 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 



MDFLLALVLV 
QSWGQCQPFY 
KTCNQDEHIP 
CMNTYGSYKC 
PSPGLQLAPD 
MYIGGKYQCH 
VYIPKVMIEP 
PIITNRPTSK 
LTTIAPAAST 
GVSADDEAKD 
TVSAAKAPGG 
HGAHGAALWG 
VSLKKGHCSE 



SSLYLQAAAE 
VLRQRIARIR 
APLDQGSEQP 
YCLNGYMLMP 
GRTCVDVDEC 
DIDECSLGQY 
SGPIHVPKGN 
PTTRPTPKPT 
PPGGITVDNR 
DPGVLVHSCN 
KAARLVLPLG 
RNGGHGWRQT 
ER (SEQ I 



FDGRWPRQIV 
CQLKAVCQPR 
LFQPLDHQAT 
DGSCSSALTC 
ATGRASCPRF 
QCSSFARCYN 
GTILKGDTGN 
PIPTPPPPPP 
VQTDPQKPRG 
FDHGLCGWIR 
RLMHSGDLCL 
QITLRGADIK 
D NO: 2) 



SSIGLCRYGG 
CKHGECIGPN 
SLPSRDLNEC 
SMANCQYGCD 
RQCVNTFGSY 
VRGSYKCKCK 
NNWIPDVGST 
LPTELRTPLP 
DVFIPRQPSN 
EKDNDLHWEP 
SFRHKVTGLH 
SVVFKGEKRR 



RIDCCWGWAR 
KCKCHPGYAG 
GLKPRPCKHR 
VVKGQIRCQC 
ICKCHKGFDL 
EGYQGDGLTC 
WWPPKTPYIP 
PTTPERPTTG 
DLFEI FEIER 
IRDPAGGQYL 
SGTLQV FVRK 
GHTGEIGLDD 



FEATURES : 

Functional domains and key regions : 

Prosite results: 

PDOC00001 PS00001 ASNJ3LYCOSYLATION 
N-glycosylation site 

320-323 NGTI 



PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



of 


matches: 10 




1 


157-159 


SYK 


2 


222-224 


TGR 


3 


284-286 


SYK 


4 


354-356 


TNR 


5 


358-360 


TSK 


6 


362-364 


TTR 


7 


366-368 


TPK 


8 


531-533 


SFR 


9 


573-575 


TLR 


10 


602-604 


SLK 



PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 
Number of matches: 3 

1 213-216 TCVD 

2 392-395 TTPE 

3 453-456 SADD 



PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 
246-252 KGFDLMY 



PDOC00008 PS00008 MYRISTYL 
N-myristoylat ion site 
Number of matches: 10 



1 


156-161 


GSYKCY 


2 


172-177 


GSCSSA 


3 


194-199 


GQIRCQ 


4 


238-243 


GSYICK 


5 


268-273 


GQYQCS 


6 


283-288 


GSYKCK 


7 


497-502 


GQYLTV 


8 


509-514 


GGKAAR 


9 


538-543 


GLHSGT 


10 


552-557 


GAHGAA 
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PDOC00010 PSOOOIO ASX_HYDROXYL 

Aspartic acid and asparagine hydroxyla t ion site 
Number of matches: 3 

1 151-162 CMNTYGSYKCYC 

2 233-244 CVNTFGSYICKC 

3 278-289 CYNVRGSYKCKC 



PDOC00016 PS00016 RGD 
Cell attachment sequence 

429-431 RGD 



PDOC00021 PS00022 EGF_1 
EGF-like domain signature 1 

92-103 CKCHPGYAGKTC 



PDOC00021 PS01186 EGF_2 
EGF-like domain signature 2 
Number of matches: 3 

1 92-103 CKCHPGYAGKTC 

2 160-174 CYCLNGYMLMPDGSC 

3 287-300 CKCKEGYQGDGLTC 



PDOC00913 PS01187 EGFJ3A 
Calcium-binding EGF-like 
Number of matches : 3 

1 136-160 

2 216-242 

3 261-287 



domain signature 

DLNECGLKPRPCKHRCMNTYGSYKC 

DVDECATGRASCPRFRQCVNTFGSYIC 

DIDECSLGQYQCSSFARCYNVRGSYKC 



Membrane spanning structure and domains 



Helix 


Begin 


End 


1 


3 


23 


2 


163 


183 


3 


397 


417 



Score Certainity 

0.942 Putative 

0.765 Putative 

1.209 Certain 



SignalP results : 

< Is the sequence a signal peptide? 

# Measure Position Value Cutoff Conclusion 
max. C 20 0.657 0.37 YES 

max. Y 20 0.713 0.34 YES 

max. S 6 0.960 0.88 YES 

mean S 1-19 0.901 0.48 YES 

# Most likely cleavage site between pos . 19 and 20: AAA-EF 
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• # 



BLAST Alignment to Top Hit: 

>CRA|50000093943299 /al tid=gi I .1 57 951 93 /de f-ref I NP_277 0 60 . 1 I 
nephronectin [Mus musculus] /org=Mus musculus 
/taxon=10090 /div=ROD /da taset=n raa /length=609 
Length = 609 

Score - 1190 bits (3045), Expect = 0.0 

Identities - 536/610 (87%), Positives - 569/610 (92%), Gaps - 3/610 (0%) 

Query: 1 MDFLLALVLVSSLYLQAAAEFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 60 

M LLA VL SSLYLQ AA+FDGRWPRQIVSS IGLCRYGGRI DCCWGWARQSWGQCQPFY 
Sbjct : 1 MAVLLAAVLASSLYLQVAADFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 60 



Query: 61 VLRQRIARIRCQLKAVCQPRCKHGECIGPNKCKCHPGYAGKTCNQDEHI- PAPLDQGSEQ 119 

VLRQR+ARIRCQLKAVCQP+CKHGEC+GPNKCKCHPG+AGKTCNQDE P PLDQGSEQ 
Sbjct : 61 VLRQRLAR I RCQLKAVCQPQCKHGECVGPNKCKCHPGFAGKTCNQDESFHPT PLDQGSEQ 120 

Query: 120 PLFQPLDHQATSLPSRDLNECGLKPRPCKHRCMNTYGSYKCYCLNGYMLMPDGSCSSALT 17 9 

PLFQP DHQAT++PSRDLNECGLKPRPCKHRCMNT+GSYKCYCLNGYML+ PDGSCSSAL+ 
Sbjct: 121 PLFQPPDHQATNVPSRDLNECGLKPRPCKHRCMNTFGSYKCYCLNGYMLLPDGSCSSALS 180 

Query: 180 CSMANCQYGCDVVKGQIRCQCPSPGLQLAPDGRTCVDVDECATGRASCPRFRQCVNTFGS 239 

CSMANCQYGCDVVKGQ+RCQCPSPGLQLAPDGRTCVD+DECATGR SCPRFRQCVNTFGS 
Sbjct: 181 CSMANCQYGCDVVKGQVRCQCPSPGLQLAPDGRTCVDIDECATGRVSCPRFRQCVNTFGS 240 

Query: 240 YICKCHKGFDLMYIGGKYQCHDIDECSLGQYQCSSFARCYNVRGSYKCKCKEGYQGDGLT 299 

YICKCH GFDLMYIGGKYQCHDIDECSLGQ+QCSS+ARCYN+ GSYKC+C++GY+GDGL 
Sbjct: 24 1 YICKCHTGFDLMYIGGKYQCHDIDECSLGQHQCSSYARCYNIHGSYKCQCRDGYEGDGLN 300 

Query: 300 CVYI PKVMIEPSGPIHVPKGNGTILKGDTGNNNWIPDVGSTWWPPKTPYI PPI TTNRPTS 359 

CVYI PKVMIEPSGPIH+P+ NGTI KGD G+ N IPD GST WP KTPYI PP+ ITNRPTS 
Sbjct: 301 CVYI PKVMIEPSGPIHMPERNGT I SKGDGGHANRIPDAGSTRWPLKTPY I PPV ITNRPTS 360 

Query: 360 KPTTRPTPKPTPIPTPPPPPPLPTELRT-PLPPTTPERPTTGLTTIAPAASTPPGGITVD 418 

KPTTRPTP PTP PTPPPPPPLPTE RT PLPPT PERP+T TTIAPA ST ITVD 
Sbjct: 361 KPTTRPTPNPTPQPTPPPPPPLPTEPRTTPLPPT-PERPSTRPTTIAPATSTTTRVITVD 419 

Query: 419 NRVQTDPQKPRGDVFI PRQPSNDLFEI FEIERGVSADDEAKDDPGVLVHSCNFDHGLCGW 478 

NR+QTDPQKPRGDVFI PRQP+NDLFEI FEIERGVSAD+E KDDPG+L+HSCNFDHGLCGW 
Sbjct: 420 NRIQTDPQKPRGDVFI PRQPTNDLFEI FEIERGVSADEEVKDDPGILIHSCNFDHGLCGW 479 

Query: 479 IREKDNDLHWEPIRDPAGGQYLTVSAAKAPGGKAARLVLPLGRLMHSGDLCLS FRHKVTG 538 

IREKD+DLHWE RDPAGGQYLTVSAAKAPGGKAARLVL LG LMHSGDLCLS FRHKVTG 
Sbjct: 480 I REKDSDLHWETARDPAGGQYLTVSAAKAPGGKAARLVLRLGHLMHSGDLCLS FRHKVTG 539 

Query: 539 LHSGTLQVFVRKHGAHGAALWGRNGGHGWRQTQITLRGADIKSVVFKGEKRRGHTGEIGL 598 

LHSGTLQV FVRKHG HGAALWGRNGGHGWRQTQITLRGAD+KSV+ FKGEKRRGHTGE IGL 
Sbjct: 540 LHSGTLQV FVRKHGTHGAALWGRNGGHGWRQTQITLRGADVKSVI FKGEKRRGHTGEIGL 599 



Query: 599 DDVSLKKGHC 608 
DDVSLK+G C 

Sbjct: 600 DDVSLKRGRC 609 <SEQ ID NO: 4) 
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>CRA| 4 6000103872 918 /altid=gi I 15128103 /de f =gb I AAK84 3 91 . 1 I AF3 97 007_1 
(AF397007) nephronectin [Mus musculus] /org=Mus musculus 
/taxon-10090 /div=ROD /dataset=nraa /length=592 
Length - 592 

Score = 1147 bits (2934), Expect - 0.0 

Identities = 520/610 (85%), Positives - 552/610 (90%), Gaps - 20/610 (3%) 

Query: 1 MDFLLALVLVSSLYLQAAAEFDGRWPRQIVSSIGLCRYGGRIDCCWGWARQSWGQCQPFY 60 

M LLA VL SSLYLQ AA+FDGRWPRQI VSSIGLCRYGGRI DCCWGWARQSWGQCQP 
Sbjct : 1 MAVLLAAVLASSLYLQVAADFDGRWPRQI VSSIGLCRYGGRI DCCWGWARQSWGQCQP-- 58 

Query: 61 VLRQRIARIRCQLKAVCQPRCKHGECIGPNKCKCHPGYAGKTCNQDEHI-PAPLDQGSEQ 119 

VCQP+CKHGEC+GPNKCKCHPG+AGKTCNQDE P PLDQGSEQ 
Sbjct: 59 VCQPQCKHGECVGPNKCKCHPGFAGKTCNQDESFHPT PLDQGSEQ 103 

Query: 120 PLFQPLDHQATSLPSRDLNECGLKPRPCKHRCMNTYGSYKCYCLNGYMLMPDGSCSSALT 179 

PLFQP DHQAT++PSRDLNECGLKPRPCKHRCMNT+GSYKCYCLNGYML+PDGSCSSAL+ 
Sbjct : 104 PLFQP PDHQATNV PS RDLNECGLKPRPCKHRCMNTFGSYKCYCLNGYMLLPDGSCS SALS 163 

Query: 180 CSMANCQYGCDVVKGQIRCQCPSPGLQLAPDGRTCVDVDECATGRASCPRFRQCVNTFGS 239 

CSMANCQYGCDVVKGQ+RCQCPSPGLQLAPDGRTCVD+DECATGR SCPRFRQCVNTFGS 
Sbjct: 164 CSMANCQYGCDVVKGQVRCQCPSPGLQLAPDGRTCVDIDECATGRVSCPRFRQCVNTFGS 223 

Query: 240 YICKCHKGFDLMYIGGKYQCHDIDECSLGQYQCSSFARCYNVRGSYKCKCKEGYQGDGLT 299 

YICKCH GFDLMYIGGKYQCHDIDECSLGQ+QCSS+ARCYN+ GSYKC+C++GY+GDGL 
Sbjct: 224 YICKCHTGFDLMYIGGKYQCHDIDECSLGQHQCSSYARCYNIHGSYKCQCRDGYEGDGLN 283 

Query: 300 CVYIPKVMIEPSGPIHVPKGNGTILKGDTGNNNWIPDVGSTWWPPKTPYI PPI ITNRPTS 359 

CVYIPKVMIEPSGPIH+P+ NGTI KGD G+ N IPD GST WP KTPYI PP+ ITNRPTS 
Sbjct: 284 CVYIPKVMIEPSGPIHMPERNGTISKGDGGHANRIPDAGSTRWPLKTPYI PPVITNRPTS 343 

Query: 360 KPTTRPTPKPTPI PTPPPPPPLPTELRT-PLPPTTPERPTTGLTTIAPAASTPPGGITVD 418 

KPTTRPTP PTP PTPPPPPPLPTE RT PLPPT PERP+T TTIAPA ST ITVD 
Sbjct: 344 KPTTRPT PNPTPQPTPPPPPPLPTEPRTTPLPPT-PERPSTRPTTIAPATSTTTRVITVD 402 

Query: 419 NRVQTDPQKPRGDVFI PRQPSNDLFEI FEIERGVSADDEAKDDPGVLVHSCNFDHGLCGW 478 

NR+QTDPQKPRGDVFI PRQP+NDLFEI FEIERGVSAD+E KDDPG+L+HSCN FDHGLCGW 
Sbjct: 403 NRIQTDPQKPRGDVFI PRQPTNDLFE I FEIERGVSADEEVKDDPGI LI HSCN FDHGLCGW 4 62 

Query: 479 IREKDNDLHWEPIRDPAGGQYLTVSAAKAPGGKAARLVLPLGRLMHSGDLCLSFRHKVTG 538 

IREKD+DLHWE RDPAGGQYLTVSAAKAPGGKAARLVL LG LMHSGDLCLS FRHKVTG 
Sbjct: 4 63 I REKDSDLHWETARDPAGGQYLTVSAAKAPGGKAARLVLRLGH LMHSGDLCLS FRHKVTG 522 

Query: 539 LHSGTLQV FVRKHGAHGAALWGRNGGHGWRQTQITLRGADIKSVVFKGEKRRGHTGEIGL 598 

LHSGTLQVFVRKHG HGAALWGRNGGHGWRQTQITLRGAD+KSV+ FKGEKRRGHTGEIGL 
Sbjct: 523 LHSGTLQVFVRKHGTHGAALWGRNGGHGWRQTQITLRGADVKS VI FKGEKRRGHTGEIGL 582 

Query: 599 DDVSLKKGHC 608 
DDVSLK+G C 

Sbjct: 583 DDVSLKRGRC 592 <SEQ ID NO: 5) 
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>CRA| 84000042916333 /a 1 1 id = g i I 1 54 3024 8 /def =gb I AAK96011 . 1 I 

(AY035899) nephronec t in long isoform [Mus musculus] 
/org=Mus musculus /taxon=10090 /div=R0D /da t aset=nraa 
/length=578 
Length - 578 

Score - 1128 bits (2885), Expect = 0.0 

Identities = 512/609 (84%), Positives = 543/609 (89%), Gaps = 32/609 (5%) 





Query : 


1 


MUr LLALV LV bbLYLQAAAh t DGRWPRQ1 VbblGLCKYGGRI DCCWGWARQoWGQCQr b Y 


b(J 








M LLA VL SSLYLQ AA+ FDGRWPRQI VSS IGLCRYGGRI DCCWGWARQSWGQCQPFY 






Sbjct : 


1 


MAVLLAAVLASSLYLQVAADFDGRWPRQIVSSIGLCRYGGRI DCCWGWARQSWGQCQPFY 


60 




Query : 


bl 


VLRQRI ARIKGQLKAVCQPRCKHGECIGPNKCKCHPGYAGKTCNQDEHI PAPLDQG5EQP 


120 








VLRQR+ARIRCQLKAVCQP+CKHGEC+GPNKCKCHPG+AGKTCNQ 






Sbjct : 


61 


VLRQRLARIRCQLKAVCQPQCKHGECVGPNKCKCHPGFAGKTCNQ 


105 




Query : 


121 


L£ QPLDHQATSLPSRDLNECGLKPRPCKHRCMNTYGSYKCYCLNGYMLMPDGSCSSA 


180 








DLNECGLKPRPCKHRCMNT+GSYKCYCLNGYML+PDGSCSSAL+C 






Sbjct : 


106 


DLNECGLKPRPCKHRCMNTFGSYKCYCLNGYMLLPDGSCSSALSC 


150 


Q 


Query : 


181 


SMANCQYGCDVVKGQIRCQCPS PGLQLAPDGRTCVDVDECATGRASCPRFRQCVNTFGSY 


24 0 








SMANCQYGCDVVKGQ+RCQCPS PGLQLAPDGRTCVD+DECATGR SCPRFRQCVNTFGSY 




Vf 


Sbjct : 


151 


SMANCQYGCDVVKGQVRCQCPSPGLQLAPDGRTCVDIDECATGRVSCPRFRQCVNTFGSY 


210 


'i 

s'rl 


Query : 


241 


ICKCHKGFDLMYIGGKYQCHDI DECSLGQYQCSS FARCYNVRGS YKCKCKEGYQGDGLTC 


300 








ICKCH GFDLMYIGGKYQCHDI DECSLGQ+QCSS+ARCYN+ GSYKC+C++GY+GDGL C 




?n 


Sbjct: 


211 


ICKCHTGFDLMYIGGKYQCHDI DECSLGQHQCSSYARCYNIHGSYKCQCRDGYEGDGLNC 


270 


■••it « 

; i. i 
4 


Query : 


301 


VYI PKVMIEPSGPIHVPKGNGT I LKGDTGNNNWI PDVGSTWWPPKTPYI PPI ITNRPTSK 


360 








VYIPKVMIEPSGPIH+P+ NGTI KGD G+ N IPD GST WP KTPYI PP+ ITNRPTSK 






Sbjct : 


271 


VYIPKVMIEPSGPIHMPERNGTISKGDGGHANRI PDAGSTRWPLKTPYIPPVITNRPTSK 


330 




Query: 


361 


PTTRPTPKPTPI PTPPPPPPLPTELR-TPLPPTTPERPTTGLTTIAPAASTPPGGITVDN 


419 








PTTRPTP PTP PTPPPPPPLPTE R TPLPP TPERP+T TTIAPA ST ITVDN 






Sbjct: 


331 


PTTRPTPNPTPQPTPPPPPPLPTEPRTTPLPP-TPERPSTRPTTIAPATSTTTRVITVDN 


389 




Query : 


420 


RVQTDPQKPRGDVFI PRQPSNDLFEI FEIERGVSADDEAKDDPGVLVHSCNFDHGLCGWI 


479 


[a a a 






R+QTDPQKPRGDVFIPRQP+NDLFEIFEIERGVSAD+E KDDPG+L+HSCNFDHGLCGWI 






Sbjct : 


390 


RIQTDPQKPRGDVFI PRQPTNDLFEIFEIERGVSADEEVKDDPGILIHSCNFDHGLCGWI 


449 




Query : 


480. 


REKDNDLHWEPIRDPAGGQYLTVSAAKAPGGKAARLVLPLGRLMHSGDLCLSFRHKVTGL 


539 








REKD+DLHWE RDPAGGQYLTVSAAKAPGGKAARLVL LG LMHSGDLCLS FRHKVTGL 






Sbjct : 


450 


REKDSDLHWETARDPAGGQY LTVSAAKAPGGKAARLVLRLGHLMHSGDLCLS FRHKVTGL 


509 




Query : 


540 


HSGTLQVFVRKHGAHGAALWGRNGGHGWRQTQITLRGADIKSVV FKGEKRRGHTGEIGLD 


599 








HSGTLQVFVRKHG HGAALWGRNGGHGWRQTQ I TLRGAD+KSV+ FKGEKRRGHTGEIGLD 






Sbjct : 


510 


HSGTLQV FVRKHGTHGAALWGRNGGHGWRQTQITLRGADVKS VI FKGEKRRGHTGEIGLD 


569 




Query : 


600 


DVSLKKGHC 608 










DVSLK+G C 






Sbjct : 


570 


DVSLKRGRC 578 (SEQ ID NO: 6) 
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Hminer search results (Pfam) : 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



CE00409 
PF00629 
PF00008 
CE00283 
PF01278 



E00409 fibulin 
MAM domain. 
EGF-like domain 
E00283 selectin 
Omptin family 



108.3 
88.2 
68.0 
13.0 
1.9 



3. 8e-30 
1 . 7e-22 
i.7e-17 
0.014 
6.6 



Parsed for domains : 



Model 


Domain 


seq-f 


seq-t 


hmm- f 


hmm-t 


score 


E-value 


PF00008 


1/5 


77 


103 . 


1 


45 [] 


17 


3 


0. 0036 


CE00283 


1/2 


74 


105 . 


119 


153 . . 


10 


7 


0.059 


PF00008 


2/5 


140 


172 . 


1 


42 [. 


19 


4 


0. 00091 


PF00008 


3/5 


180 


214 . 


1 


45 [] 


19 


0 


0. 0012 


CE00409 


1/3 


136 


227 . 


415 


514 . . 


69 


4 


4 . 5e-19 


CE00283 


2/2 


226 


248 . 


124 


146 . . 


8 


7 


0.2 


PF00008 


4/5 


220 


249 . 


1 


41 [. 


10 


8 


0.25 


CE00409 


2/3 


232 


273 . 


429 


473 . . 


35 


2 


2. 3e-09 


CE00409 


3/3 


277 


298 . 


344 


365 . . 


16 


3 


0.00053 


PF00008 


5/5 


265 


300 . 


1 


45 [] 


31 


7 


2. 9e-07 


PF01278 


1/1 


477 


493 . 


241 


257 . . 


1 


9 


6.6 


PF00629 


1/1 


469 


610 . 


1 


170 [] 


88 


2 


1.7e-22 
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1 GAGAAAATTG AGATTACTAC CTGCAAGGTG TCATTACCTG GTAAGAAGCC 
51 TATCAAAAGT TTGTCCTCCT GAAAAAGTAG TTATTGCTAA AAGCTAGCTG 
101 TTTTGATCTC ATTCTTGCTC ATTTGTTTTT AAGACTGAGA TAATGAAATG 
151 TCACTCCCAT GGCAACTCTG CCTCTTTTTC GGAATGATCA TTGGTGGTCA 
201 TAGTTGCAGC ATAATAACCA GTTAGACCTT GGAAATCCTT TAGATTCTCC 
2 51 TTATTCCATG ATTTAACAAA GACTGATATA ATTAGCTACA TTTTACTGAA 
301 GGGAGAAGCT AAAGTTCACA GGCAGAATTC AATTTAATCC AATCCATCTG 
351 TTTNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNCT ATTGGTTAGA 
4 51 CTAATGTCTC ACGGGACCCA GAGTGGTAGG GAGGCAATGA CAACACAATA 
501 CATTAAGAGC TACCTTAGAG CATGCTAAGT GTAATAGAAG TATATAAAAC 
551 ACTAGTCTCA GCCAATCAAA AAGTCAGAAA AGGCTTCCTG GGGGACTGTC 
601 GGCTAAATTT AAACTCAAAG GGTAAGGGGA GATTGTCCAA ATGAAGAAGG 
651 AAGAGGAATG AAGAGGAGGC TAAATTTAAA TGAAGAAGGA GTGTTTTCTA 
7 01 GGCCAAAGCA AATATATGGA AAACAAAAGA AAGACGCATG TAGATGGGGC 
7 51 ATTACACTTT TCCCTCCAGT TATTTATCCT GTTTTCATCC ACCACTCTTC 
801 GTCTTTTCCT TAGATCTCCA GTTTTTTAGC CGTATATTAC CCCCTTTCTC 
851 TCTAATCATC CATTGCACAC AGTGAGGTTT ATTTGTAAAT CTAACCCAGG 
901 CACCAACTAA CCAACCAACA AACAAAAACA GCTAAGGAGT AGCCACTGGA 
951 ACCTGGAACC ATGCCTACAC TTACAAAAAT TTGATTTTCT GCAGAAAAAA 
1001 TATTTTGACT CCTACATTTT TGGACTTCAT TAGAAGGACC TGAAATGGAT 
1051 GACACCAAGC TGTTTGCCTA AAATAATGTC CCAAGCCTGA ATTGGCATGG 
1101 ATCTTTCTTG AGATTAAAAT AGAAACTTGT TTTGCTAACT GAAAACAACT 
1151 TAGAAATCAA AGAGCCATTT AAGTTGAAAC CATTATTTTT CCTTTCCTTG 
1201 AAGAAAATTC CTGTTTTTAC ACACACTGAA TGATCAGGAT AGTGAATCAC 
1251 CCTACCACAG AACTTTCCAT TAAAAATTTG AAGTTGTAGA AACCTCAAAA 
1301 AGAAAATGAA GATGGGGGGA AAACGTTTGT AATGTAGCAA ATGANNNNNN 
1351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
17 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNAGATG ATCACTGCAC 
1901 AAGGCTGTCA TTCGAGGTAA GGTTGGGAAG AGGGGTGACT TGATTGGGCT 
1951 CATGTACATC TTAGGATAAT CATGTTTTCA TTTGAACACA GTCATTTAGT 
2001 ACAATACTTT TTACTCTGCA ACAACTGAAC AAT GAT TACT TAAAATGGTT 
2051 TCTAAGTGAT CCCCTCAAGA TCAGGCACCA CAACACTACT AGGCAGTAGC 
2101 AGCCCTTCCT CTCTGGCAGG CACTCATCCC TAGAGGGGCA GTCCTGTCCT 
2151 AATGTGCTCG GGAGTTGAGC TGATAGAGAC AGAAACAGTC TGCTCATTTC 
2201 TTCTTCAAAA CCCGACTGGA GAGCCAAATA TTTCTTTTGG TTAGTTACAA 
2251 AATAAAAAAT ATGCCTTTGC CAACTCTGCG GGAAAAGCAA ACTCCCAGAG 
2301 TTTGCCTTCG CTAATTTGTC CAAGTCGTGC TGTGGTTTTA GTAAATGCAG 
2351 ACACTGCTCA GCTCCAGCCC CATAAACCTC TCTGCTCTAG GGCTTCTCCG 
2401 CCCTCCGGTG AAAGCTACTG CTCGCCCCTG CAGTCACCAC CTGTTCGGGC 
2 4 51 GGAACCTGCG GAGCGTGCAC CTACGCCTCG GGCTCCTTTC CTCCTCCACT 
2501 CCCCTTTCCT GCTGGGCACC CTGCTTTCCC TCTCCCAGAG AGGGTTTGCA 
2 551 ACTTTTCTCC CAGGCTGGGG CTCGCCCTGC TTGGCTAACC CCCAAGAGCC 
2 601 ACTGCCGTCC CGCAGCGCCC CTGCCCCCGA GTTGCCTGCC CCGCTGGGCC 
2 651 CCCGGGAGGA GCGGAGCGCG CTCACCCTTC GCCCGGGGCT GGGAGGGCGG 
2701 CGAGTCGGGC GCACGCGCAC CCCCTGCCCG CCCCTGGCGC CCCTCCCCGC 
27 51 GGGCGGTGCA GCTACCCCTG CAGCGCCTCC CCTAGCTAGA AGGGAGCGGG 
2801 AGGGGGCTCC GGGCGCCGCG CAGCAGACCT GCTCCGGCCG CGCGCCTCGC 
2851 CGCTGTCCTC CGGGAGCGGC AGCAGTAGCC CGGGCGGCGA GGGCTGGGGG 
2901 TTCCTCGAGA CTCTCAGAGG GGCGCCTCCC ATCGGCGCCC ACCACCCCAA 
2951 CCTGTTCCTC GCGCGCCACT GCGCTGCGCC CCAGGACCCG CTGCCCAACA 
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3001 TGGATTTTCT CCTGGCGCTG GTGCTGGTAT CCTCGCTCTA CCTGCAGGCG 
3051 GCCGCCGAGT TCGACGGGAG GTGAGCTGGG CCCCGGGGCG CCCTCTCCTC 
3101 CTTCCCGCGC TAATTTCACA CTCACTGTCT TGGGTCACTT TTCCCCGCGG 
3151 GGTTTCGTGG TCAGAGAGGC GTCTCCTCCA TCCAGAAGTT GGGCCACCGC 
32 01 ACAGCGTGGC GCGAGGAGAG CGGTCCAGCG GCTCCGAGTG CCCGCCCGAG 
3251 GCGGAGAGGG CGCGCCCTTG CGAGTCTGGG ACCCCATCCG CGGCCCCCCG 
3301 AGGGCGACTC GCCCCGGCTC GGGAATTAGG ACTGAGGGAG AGGAGCCGCT 
3 351 GGAGCCTGGG ATCTCGGCTC TGAGGGCGCG GTTTAGCCAC CTACGCCGAG 
3 4 01 GTGACGCGCG AAACATCCCT TACCCGGGAA ACTCCCGCGC CTGAACTAGA 
3 4 51 CGGCTCTTCA CTGGGGAAGC TTCCAGGCCC CCGGGGGGAG GCCCGGGCTC 
3501 TGCTCAGGGC TCTCGGGGCC GCTCACACAG AGAGTGGGTG CGAGTCAGCG 
3551 ACTGGGCTAC GGGGGAGATT TGTGGGCCTC TCCATTTGGT TTTCTTGAGG 
3 601 GAAGGAGACT CAAAATGAGG ACCGGAGGGT GGGCGCTCCG TGAATGTGAG 
3 651 CATGAGTGTG TGGATGTGTG TGTGAGAGCG CACACACTGC GCCGCTCCTC 
37 01 AGACTCGGGC GAGCCTGACG GCGGCGTGCT GTGACAGGTT CCAACAACCT 
3751 CGGGCCGCGT CTCCGCTGTC ACTCAGCCGG TCCTCCCGCG CTCCGGGGCC 
3801 GCTCCGGTGT GTGAGAGACA CTGGGTCTGT CGGGAGGGTG TGCTCGGTCC 
3851 CCCTCACCTC TGTGCAATTA CAGACTAGGC TCGTCCCGGG TGCAGATGGC 
3 901 TGCTGCGAAA AGAGGTTTTA CTCTGGCGCA CACCGTCGCC CGGTCGCCTT 

3 951 CCTTCAGCGA CCTCTGCCCC CACCCCTCCG TGTAGGTCCC TGCCTAGTCC 

4 001 TAAAGAAAGA TGCCGCACCT GTTTTACCTT TAATCTTGGA ACGAATCAAT 
4 051 TCCACAATTG ATTCGCTTTC TTGTACCCGG AGGCGAACGG AGGGGAACCA 
4101 GTGGAGCGCC AAGAAAAAGC ACAGCTGCTT GTTCACTACC TGGTCGGAGG 
4151 AAATAGAAAG TTCGGGGTTA TTTTGTACTT TGGGCCTGGG GGTAGAAAGG 
4 201 CAGGTAAAAG AAAAGGGGAA TTGAAAAAGA TAAGGAGACT TTAAGAAAAC 
4 251 GTGATAGCAG CCAGAGTGTA GACCTTTTTA TTTTATTTTT AAAAGCAATT 
4 301 CTGTGCTCAC ATTTGGGTAT GTTATGCATT TCTTGCACAC ATATTTGCAA 
4 351 CAAGAAACCC ATCACAAGAT AT G CAT AT G A TGTGAATGCA TATAGCTTTT 
4 4 01 TGTAACTTTA AAAAGATGTC TAAAACAGCA AATTAAAATG TTGATACCCA 
4 4 51 GAGGAAAAAA GTCATTTAGG TGTGACTCTA AACAAGGAAA CAATTTAGCA 
4 501 AATAATGTGT CAACGTGTAA TCAGGTGGAT AAATCTGTAT CCTGAAATTA 
4 551 CTTTCTTTAG GCATTATTTA CATTAGAGAG GAAAATACTA TGAATTGTTG 
4 601 ATCTAAGCAC GTTTCAAACC ACAAGGATAC TTGAGATATC AGCTACACTT 
4 651 TAAACTCTCC TTGGTTTTAT TTAGACTTTT TTCTAGTACT TTTTTGTTTT 
4 7 01 TTTTGTTTCT GCCACAGTTA TGTCTCAAAA AGAGCTCTGT TACACTTGAT 
4 7 51 TTTTGAGAAA CCTTTCTGCT CCTTCCCCCA TCCACCTTTT ATATGAGGCA 
4 801 GAAATTTTTT CTGCTTTGAC ATGTCTTACT TAATACTTTT CAGTTTATGG 
4 851 TGAATCAGTC AAACCTGGCT TTCACCCCAG TAACAGGTGT GGGTCTTTGA 
4 901 GGAATCTGTT TTATTTCAAG CTTCAGAAAC TATCCTGTGA GTGGCAGCCT 

4 951 TGAGAATTGT TGGTGTTAGC TGGTGCTATT TCTACTTAAG AAAGCATCTT 
5001 TCCCTCCCGT TTTTTCTCCT ACCCTTGTGT GTTGGGGGAT GGGGTATTTA 
5051 ACTGTCTGAA ATTTGACATT AGATCATAGC AGAACTATTT CTGTGAAGGT 
5101 GTTTTAACCT TAAATTTCTG CAGGCATAAA AGAGTTTGTA GAAAATTGTT 
5151 GGGTGTGGTT GACATTTTTT CTGAAGTAAT AATACAGAGG AAATTACCTT 
5201 TCTTCTCAAA AGTGTCCTTA TTTATTTTGA ATCTTTTTTT TTGGTAGGTG 
5251 GCCCAGGCAA ATAGTGTCAT CGATTGGCCT ATGTCGTTAT GGTGGGAGGA 
5301 TTGACTGCTG CTGGGGCTGG GCTCGCCAGT CTTGGGGACA GTGTCAGCGT 
5351 GAGTATCAAG CCTGGGGACT TCAGTTCCCT GGGAGGTGTG GCTTTCCACC 
54 01 TTGTTCATGG CTTCACCCCA CATATCAGAG GGTTCATTAC TGAGCAAGGC 
54 51 TTGGCCTTGC AGGTCTGACT TGGGGATTTT CAGGTACAGT CCAGACTCCT 
5501 TATTCTGCTT CTTTTCAGCT TTAGCCACCT GTATTACGGC CCAGCTTTGT 
5551 CATTCACAGA GAGCCACCTT AAATGTTTCT TTTAGCATTT TTCCCCCCAA 

5 601 ATCTGCATCT CCTTCCTGTT TTCTTACCAG GTTTATTNNN NNNNNNNNNN 
5 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5701 NNNNNNNNNN NNNNNNNNNN NNNNNNCTGA CTCGCTTGCT CCAGGCTGGG 
57 51 AGATGGGGAT AAGATAATGA GTCTAGACCG TATTTACACT GGGGATAAGA 
5801 TAATGAGTCT AGACCGTATT TACACAAAAT AAACTCTGAA CCTTAGAGTA 
5851 CTTATATAAT CCTCACCATA GCTGTTGTGT AAAGTGGCCC ATTTGTAAAT 
5901 CCTTTTTTGG ACTGGGGTTG TATTGACAGT GTTTTAAATG GACCACCCAG 
5951 TATAATGAAA CAAAGCCACT GAAACAACAT TATTTGTAAG TTCTATAAGT 
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6001 AACCAACCTC ATGTAATAAG TACCAAATCA GAAAGCTGAT TTTCACCTTT 

6051 TCTGTGACCA CAAATGGATA TTTTAATGGT AAAATTTAGA GCTACATCAA 

6101 AAGAGTTTGA GAAATTTAGG AAATTTGTCC CCAGCATTTT ATCGCGATCT 

6151 TAAAATTGTA TCTCACTCCT ACTGCAAAAA AATAGTCTTC AAATGATCAA 

62 01 GTACCTTTCC AGAGCACCCT TAGAGTGCTT GGGGCGGGGG TAGGGGGCAC 

6251 TTCTGGTAAG ATGATGGGAA CTAAGTTGGG TTCTACATTG GGATATATAT 

6301 TTTATTGCTA ATGAGGAGGA GGCTTAGAGG AAGAGAGAAG GGCAGTTACG 

6351 AAGGCTAGAG CTGGCAATGG AGAAGCCTGC CTTAGAGATG GGTTGCTAGT 

64 01 GTGAGGAGTC AGGCAAATTT AAGTTCAGGA AAGTTAGGAG TTCCCTCTGC 

64 51 TATTTTAATT TTTGAGGATG CTTGCAATGT CTTCCTTAAT TTTGTGAAAG 

6501 AGGGACAGTG ACAGTCACAG ATTGACTCTA ATTGCACATA AAGACCACAA 

6551 TCTCTGGTTG GGAATAGAAA GGTAAAGGAA ATGAATGTTT GCCTACCTGG 

6601 TATGGAATTT GAGAACCAAC AGATTCTAAT AACCAAAATG TGAAGAAAGG 

6651 ACCCTTCTGT TGGCCCAACA CACCTACACA TAACCCTCCT GAGTGAAAAA 

67 01 TGAGTAGTTC TATACCTGCA GTCTCCAGCT GTGCAAATAC TTCTGATACT 

67 51 ACAGAAGACT AAATTCCACC AGGCACCATT CTTCTTTTTT GATCATCTTC 

6801 CCTTAAAACA ATATTGAATA GACTAACCAG TGAGTGTACA GCAGCTTTGC 

6851 CTACTTCTTT TGTTTACTGG AAACTGGAGT TACCACCATC TCCCTTTAAC 

6901 AGAATGTAAT TGACCCCCCG TGCAAAGGGT CTAGCCAAGC AGCTCGCACA 

6951 TAGCAGTTAG GCAAACATTC GTTTGCTTCA CCTTTTATGT TTATACTTCC 

7001 AAAATCAAAG TAGTTCCAAG GTTTTCTATG TTACAGTGAA ATAAAATCCC 

7051 TTTCAATTAA AAAGGCACAA ATGGTTCTTT ACTATTAACA TTGAAGTTGG 

7101 TGAGGTTGTA AAAATTAGCT CAAAGGTGAA TGTTTCCTTC TGTGTATTTT 

7151 ATTTTTCCAG CATCTTAGAT GGCGCAAATG TCTCTTGTCA GTTCAGAGTT 

7201 CTGCCTGTCT TTGTTTTGAT ATAAGCAGGT AGAGGAATGT GGGGCTGAGA 

7251 AGTAAGCTTG AAGGGGCAGA ACAAACCAAA AAGAGGCTGA TCAGATTGAA 

7 301 TGAAATATCT CTGAAAACTC TTGATTATTT TAAAGAAAGT CTTTATGAAA 

7 351 TTAAAAGTTT TGTCACTACG TTTGTTCAAG AAAATGCCTT GCTATTGTAT 

7 4 01 AACAATTCAA TCTAATATGA TTCCTTATAA AGATTCCAAA GAACTTCTAG 

7 4 51 CGAATTTAAT GTGAGAAATG TTTTTGCCTT TTCGACCTTT AGATAATCAT 

7 501 GTAGTTCTTT CCCATAAGGA AGGGCTATTC TCCCTTCCTC ATCAGAGGTG 

7551 CTCTGGTTCT CTTCTTTTCT GAATGATTCA CTTTGGAATT TTCCTTCAAA 

7 601 ACAGCATAGC AAAACAAAAA GAAACTATTC CCATTACTGC ATAGATCTTC 

7 651 CCAACTTATC CCATAGAAGG TGTGTTTGTA GGTAGGGAAA GGTGGTGGTA 

7 7 01 CCCTCATTAT ATTATTTAAC AGACCTTATG CCTTAGTGCA GTGACCCTTT 

7 7 51 GGTGAGTGTT TATTCCGTTT GGTTGAGCAT TTGTGAGATG TTTGACTTAA 

7 801 TATTCATGTG AGTCAAATGT ACGTATCTGT ACAAAACATG CTGCCCTTCA 

7 851 TTTTCACTTG CTATCTTCCT ATCCATGGTC TTGTTTGGAG AAACCGACTA 

7 901 ATGTTGCAGG ATGCTAAAGC TGGTAGACCT CTCCTTCTGG CTCACTATGT 

7 951 CTAAGCAGAG CCAGATATAG CTGGGAAACT TTATATCCTT CCCTCTAGGA 
8001 CTCAGGAAGG AAGGATCAGA GATGCTACTC AAATGGGCAT AGAACCTGTC 
8051 CTGCTGCTTC CTGCCTGTAC CACCCAACAT TCTATCCCAA CATTCCTGCC 
8101 TGCTAGGGAA AGCATGACCC GTTCCAGAAA TAGGCAGGTT GTGTTTTCAT 
8151 AGTTCTTTTG TAGGTCATAC CCTTGGTTGN NNNNNNNNNN NNNNNNNNNN 
8201 ATTCTTTGTT TTGTGTTCTT GACTCTTAAA CAGATCTCTA GCATATTGAA 
8251 AATTCAACAT TTGATTTTCT AACTGTCATG GGCTTTACTT TTATTGACTA 

8 301 TTGATGTGGC TGTTTATTGT CAGGTGAAAA TTTTTTAATA TGTTCACACA 
8351 CTGATGATTG CATATTTGCA GCACACAGCA TCTTAAACCA CTCAGAGGTT 
8 4 01 TGTCACAAAA TGTGTGTTTC TTGCTCTGTA ATTTTTTTGT CATTTTGATG 
84 51 GCATATTTTA ATTATGCTTT TATTTCTCTC CTTCTACCAA GTGGAGACCT 
8501 CTGTAACAAA GATTTTTCAG GGTTGCATAT TTCATCTTTA AAAGCTGTGT 
8 551 ACACTGTGGA AACAATTGTT AACCAAAAGC ACTAGATAAT TTAGGATAAC 
8 601 TGAGTTCAAG TTCTGACTTT TGTACCTGAA TAGGTGTCTA ACATTTCTCA 
8 651 GCTACAGTTT TCTTATTACA CAAGCATATT TCCAAGGTAT GTTGAAGCTC 
87 01 TAAACTGCAA CTGAAAACTT CCTTAATGTA GAAAAACTAT ATAGGATCTA 
87 51 AATATTGTAT TTTTGTATGG TTGTCCTTCT GTTTACTCGG AGATTTGACT 
8 801 GTATATGTGG CTTATGACAA TAGCATTTTT GTTAAAAGCA TTTTATAGAA 
8851 GTGTGAAGAA AAACTAAAAA TACAACCAGT TCCAAGGTTT AACAAAACTA 
8901 TTCCGTTTCT GAGTTCTTTG GCTGTCATTG AGCAACTTGT GGTTTCTGAA 
8 951 GGAAATTATG TGAATTAGGA TGGTTTTGTA TCATTTATCC TTAAGAACAG 
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• 



9001 
9051 
9101 
9151 
9201 
9251 
9301 
9351 
9401 
9451 
9501 
9551 
9601 
9651 
9701 
9751 
9801 
9851 
9901 
9951 
10001 
10051 
10101 
10151 
10201 
10251 
10301 
10351 
10401 
10451 
10501 
10551 
10601 
10651 
10701 
10751 
10801 
10851 
10901 
10951 
11001 
11051 
11101 
11151 
11201 
11251 
11301 
11351 
11401 
11451 
11501 
11551 
11601 
11651 
11701 
11751 
11801 
11851 
11901 
11951 



GGAAAATTGA 
TGAAACCTAC 
GCTGCCTCTT 
TAAACTTACA 
TGAAGCTGTT 
CTGGCTAATG 
TTCTTTCTGC 
GATTTAGTAA 
CTGGGTTCAG 
CCTAAACTGG 
ATATATTTCT 
AGAGAAAATT 
ATAGTGTTCC 
AGGTCAACAA 
GTAAAGCAAT 
AAAAGGGGTG 
GAATTCTGGG 
GTAGTAGCTT 
TGTTTGTTTG 
ATTACTTGCA 
TTTGATTAAG 
CATCATGTAC 
AGTAAGATCA 
AGTATGTGTA 
TATTCTTTTC 
ACTAATTTCC 
TTTATGGATT 
TAAAGGGAAG 
AAATACCTGC 
ACTGGGACCA 
TTGTGGAAAT 
CCTCCTGTGT 
AATTCATGTT 
AGATGATAAT 
TAAGGGAAGA 
AAGGGCAAGG 
ACTGCCACAT 
TTTTAAATTT 
GAAAATGTAT 
TTAGTTTTTG 
GCAATAGATC 
ACTATATACA 
GAAAGGTTTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCNNNNNNNN 
NNNNNNNNNN 
AAAAAACTTT 
TTCTGCAAAA 
TTTTTATCAA 
GGGGAAAGAC 
TCCAGAGTCT 
TCCCAAGAAA 
TCTGTTTCCC 
TGAGATGTGT 
TACTTGTAAG 



GATGTTTTCT 
ACCTACAGAT 
CCACGTAAAG 
GGTTAGTGAA 
TTGGAGTGGT 
ATTGTGGATA 
ATTTCAAATG 
ACAAAATATT 
ATACTCAGTC 
TTATTAACTT 
TTGATTATTT 
AGAGCTGTTA 
CTCCACAGAG 
CTATGACAAC 
TTTTATACCT 
GGGATTGTAC 
GCCAGCTTGG 
CCTCCTCTAA 
CTTTTAGGTA 
ACATTAAACT 
ATTATTTAAC 
CTTTGGGTAA 
GCGTTGAAGA 
GTCTTCAAAG 
CCCAATCAGA 
TGAAATCCCT 
AGAAATTTTA 
CACTCTCCTC 
CACAGGGATA 
AGAAGGCACT 
TTTTGATATG 
GTAACTTCCA 
CTTTGCATTT 
TAGTGAAGCT 
TATTCATCAC 
GAGAGGGATT 
GCAGACACTC 
TTCCACCAAT 
TCATTTCCTT 
GTATACTTTA 
TCTCCCCTAT 
CTGCTTTGTA 
ATTTATAGAC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GAACATATGT 
TAAATTTTTA 
TTGCCCAATG 
TGAATGGCGA 
GATCTTTCCT 
TCCCTCTGAC 
ATCAGACCAG 
CCATTTGCCA 
AAAGAGCCAC 



TATGTTTCTG 
TGACCTTGCT 
TGAAAATTTT 
ACATAGGGAC 
CAAATAATTT 
TTGGAAGTGA 
AAAAGGCTAT 
TAAATTTCAT 
TTATAAATGG 
CCTGTCCATA 
GGATTTTGAA 
GAATGTTAGA 
AGGTTATACT 
TTCCATTGAA 
GTATGCACCC 
TGTTTACATG 
TGGAGCGGAG 
TTTTGAGGTG 
GGAATAAAAG 
TGAATCCACA 
TGCCACTTTC 
CTTCAAGTGG 
ATCTTGACCT 
TTAAGTTTCA 
ACTGATCTTT 
TCATAAGCTT 
GTACTTCTTA 
TCTGCCAGTT 
TGCTGCTTGA 
TCTGGTGCCA 
CTTTAAAAAA 
GCATTTCTTT 
TGAGTTAGTT 
CAATGATTCT 
AGGGGTGGGC 
CTGATGGCCT 
TTGCCGCTTT 
CTTTGGAAAA 
AAATATCCCT 
AAAAATATTT 
CACCCACTTG 
TGCTGCTGCT 
CTTCTTACNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNC 
NNNNNNNNNN 
NNNNNNNNNN 
TAAAGATGTT 
TATGTGGTCA 
ACTTCTTACT 
TGCTGACCCA 
TGATTTTAGG 
AACTCAGTGA 
AGGCAATGGG 
GGGTCTCCCA 
GCATTCCTTT 



CTGGAGATTT 
TAGTTAGCTC 
GGATTCTTAT 
TGAGATATAG 
TAGTTGGATA 
TGAAAAAATT 
TAGTTTGAGC 
GTTTCATTTC 
AACATGATTT 
ATCACAAAAC 
TACTCTCTTT 
ATTTGTTTTG 
AGTAAATGCC 
CATGAGTTAG 
AAAAAGTAAC 
TATATTGAAG 
TACACTTCAT 
AGAATGCAGA 
CAAGGCAAGG 
AATCCTTAGG 
CTTGAAAGGT 
TCTTGGAATG 
TTTCAAACAG 
GAGAAATTGC 
ATGTAATACT 
AATCTGGCCA 
AGCTAGAAGG 
CAGCAACTAG 
AGTTGCCCAC 
GAAACAACAC 
TGTAGGTGTC 
TTGTTGCTTT 
GAATCCACTG 
ATGCAGTGTC 
CTTCATCACA 
AACCCTCATT 
CCCATTGATT 
TATTAATTTA 
CCTCAGAGCA 
TACCTACTTA 
TTTTGTACAA 
TTTTCTTAAC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CTCCTTACTT 
NNNNNNNNNN 
NNNNNNNNNN 
GTTTTGCATG 
AATGGCAGAT 
CAGTAGAGAA 
AAGGCTCCAA 
ATTCTTTGAT 
ATGTCCATCT 
CTGAAGCTAG 
GTGGCTTCCT 
AGGAATTGCT 



TGGAAAGATG 
TGAACCTCCT 
CGGCTTCAGA 
TAATTCATTC 
ATATATATTA 
ATTGAATTAT 
AGAGAATTTT 
TTTCTCCTAT 
ATTTTTGCTC 
TAT AT AG ATT 
AAAATAATCA 
AAGGCCACAC 
TTTCTATTTG 
TATTTTAAAC 
AGGGGCTCTC 
ATTGCTAGCA 
TGTCCTTAGT 
ATCTGTTTTT 
AATAATTTTG 
AAGTGAAGTT 
TGTTTAAGAA 
CAGATTCCAA 
GTAATTTGTT 
TTTGCTCTTT 
ATTAAGATCT 
GGTCTTAACC 
CCAGGCCAAC 
ATCTGTCCTC 
GCATGTACAC 
TGTGTTTGCT 
TTCTCCCTCA 
TTATCTACAG 
TATGCTTTCC 
TTTCAGTGGG 
GGGGAGATGG 
ATTGCCCCCC 
TTGTGCAATA 
TTTTGTAGGA 
ACAAGCATGA 
TGGGTAAATT 
ATGAAAGCAT 
TGTACATGTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTTTATGCAC 
NNNNNNNNNN 
NNNNNNNNNN 
TTCTTAGTGT 
ACATTTGTAA 
GATAGTTTAA 
CAATCCCATA 
CCCTTTTCTT 
GCTCCCATCA 
GAGAATGGAG 
GCCTATCATT 
TAATTCATTG 
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ATCTTTTATT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNTCAACTGA 
CTTGNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CTTTGTGTAT 
GGAGGAATAT 
TTATGATAAA 
ACTTTTTGTA 
ATTCTAATCC 
GTAAGAAATG 
AGACAGCAGT 
AGTTNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TAGATATGGA 
AATTGAAGAG 
TAATGTAAAG 
TGTTCAGTCA 
AATTAAATTG 
GAAAGTATAT 
TAGGGGCTAC 
GGTTTTTCAA 
AATTTGGCTA 
TTTAATTTGT 
AATTATGGTT 
TTTACGGAGT 
CTATCATGAA 
GTCAACATAA, 
GTTCTATACA 
GCTAGATACA 
TTTGGATGGA 
TAACAATGTT 
CATTTTGTCT 
CTAATGGAGT 
CCGCCTAGGC 
TTTTTAATAT 
CCATTTTGCT 
AGGAGGAAAG 
CTGATACATG 
CAGTGCTTAT 
CAAAGTAACC 
TTTATATAAG 
AATACTCTGC 
TGGGGTGTGC 
TAGAACCTGA 
AAAAAAAGCA 
GAGTGCAGCA 
AGGTCTTCAA 
CTGTCTGAAT 
GCATTGCCAG 
AGTCCTCAAA 
TTCAAAGTGA 



ATCTTTGTAG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TGGAGTTGAT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AGTATGGGCT 
GTAGAATTCT 
AAGATTTTCA 
GTGTCCTTAA 
TTATTCCAAT 
CATGATTAGG 
GAAGGTAAGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GGACATATCT 
GTTTCATTTT 
GAGAGTAAAT 
TCTTCTATTT 
ATTTCTTTAT 
AAAGTGGCTT 
ATGGAGGAAA 
ATGCAGATAT 
CTAGCTTCTG 
GTTGTTTGTC 
TTACTGCATG 
GGGAGTGAAT 
GCCAGTGGTG 
GATGAGGCCC 
TAGTTTAAGA 
GCCCAGAATA 
CAGCTCTGGA 
CTGAAGTTTG 
TTCTTTTTTA 
TGATATCCAG 
TCCAGTAAAA 
CAATTTTGTA 
ATATACATAT 
GCATGAACAT 
GATATGGGCC 
TTCCCCAAAG 
TCAACTGTTT 
TTGCTTAGAC 
ATATCAACAT 
GGAAGGGCTG 
AATCCTGATG 
AATAGTGATT 
ATCCAGTAAG 
ACGCCTTTTC 
TGTCACCGTA 
GGCCGAGGTT 
GATAGTAAAA 
TGTCAATAAC 



TTTAGCCATA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATCCAGAGAC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 

GAGATCTGAA 
GGAAATATTA 
AACCATAGGG 
TATATACTTA 
TGAAAAATTA 
ATTTCTAACT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNCA 
AAAGACATTT 
ATTGCTACAA 
CATGGCTAAT 
TTGATTCCTT 
TTCTATTTGT 
CCAGGAATTC 
ATGGGGGTAA 
GTAAGCCATA 
GTAGCCAAAG 
AGGGAAAAAG 
TAGTTCTGAA 
GATGGGATGG 
GTTTTTGAAA 
T TACT CT AAA 
AGCATTATGT 
TCTAGCTGTG 
TTAGACAAGA 
CTGTATTATT 
AGTCGATATA 
AGACATTGTG 
GGAGGATCTG 
TCCATTGGAT 
TCATTTCCAT 
TGTTCCCAGG 
AGTGAGCCCA 
TTCCTGATTG 
TCTTAATTCA 
CTGAACAGAC 
CCCGTACCCA 
ATTCTCATGT 
CACATATACC 
TTAAAAATTA 
TGTCACATCT 
TGCAGAGACT 
TATCAACGTG 
TTCTGTCTTC 
TGTAGCCAAG 
CAGCTACAAC 



ANNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AAGATACAGA 
NNNNNNNNNN 
NNNNNNNNNN 
NTTAGCCATA 
CTAGACTTAT 
TACCATTTTG 
AGACATTTTT 
GCATATATTT 
AATTGTCTCC 
AAAGACACAT 
TTCCTGTTTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCATCACTGT 
GATTTCTGTA 
ATTTTCTTAT 
ATTACAGAGT 
AGAAGATACC 
TTATGCCCTA 
ACACAATTAA 
AATAAAAATA 
GGATCCTACA 
GTAAAAGATG 
CAAACAAGTT 
CGAAGACTTT 
AAGGGGCCAC 
AGCTGTTTCA 
AGTACAGTTC 
GCATAGCTTT 
ATGGATGAAC 
TAGTTTCAGG 
TAACAGATTA 
TTTTGAAGAT 
AATAGAGCAG 
CAAAGAAATT 
AGCAATATAT 
GCCAGTTATG 
CATCATTTAC 
GGAACAACAG 
CTATAGGTGT 
CCTGGATTTA 
TCAAAGCAGA 
AACCTAAGTC 
ACTCTTGAAG 
CTAGGATAAT 
TGACTCAATG 
CCTTGGGAGG 
AGCGTGAACC 
GCAGCGAGCT 
ACGGTAACAA 
ATAATTTGGC 
CACAGAAAAA 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GATTCTGTAT 
NNNNNNNNNN 
NNNNNNNNNN 
ATTTTTATAC 
AACCTATTAA 
GGGACACCGT 
GTACCAAAAT 
TCTAATAGTT 
TTTTAGAAAA 
ACTAAGAGCA 
CCTGACACCT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TGTATACCTT 
TTAGTATAGT 
GTTTAATCTG 
AGAAACCTTC 
TTAAAAGATA 
TCTTGTTTGG 
AATAAGGAAA 
TTAGGAGGAG 
TATGTCCTAC 
TTATTGCTGA 
ATGTGGGAGG 
CACGTGGCCC 
AGCAACATTC 
TAGAAGATCC 
ACGGAAAGCT 
CTGACGTTCA 
TGTATAGGGT 
TTAGAATCAC 
AAGGCCAGTT 
CAGTAATCAA 
GGATAGGGCC 
AGTGGGAGAT 
AAATTCACCA 
GCTTGGAATA 
CCTACTGATG 
AATCCCTGAC 
TCCGGGGAGT 
TAAGGGTTGA 
GTCTGTAGGA 
ATCTTTTCAC 
CCCTAGGCAA 
TTCTCTCTCA 
CATATTCTGA 
AACAAAGAGA 
AAGAATCTCC 
GAGAAAGTTA 
ACAAATGATC 
TTGAAAAAAA 
TACTAGTAGC 
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ATATTGATGG 
ATACACCAGT 
AGTAATATCA 
TTTTAAGTTT 
GAATTATGGC 
TGAAGGAGTA 
AAAATAGTGA 
GAGAATTTCA 
ATTCTTAACT 
TTAGCTTTAG 
ATTATTTTTT 
GATACTAGAT 
GTTATTTGCA 
CTTCATTTGA 
TTCTCCAGAG 
AGGGTTGCCT 
ATGTCATCAC 
CTAGGAATTT 
AAAAACCTTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNCTAA 
AGGAAAGCCC 
AACTCCATGG 
CCAAAGTGTT 
TACTTGGTAA 
ATTGATAATT 
TTTAGTGTAT 
GTGGTAATAG 
AAGTATTTTA 
TCTCTTGGGT 
GGTGTCTATG 
TTCTAACTAA 
AGTATAAAAT 
TTAAATCTTG 
TAAAGTAGTG 
TGTCCATAGG 
TAGGTAGTTT 
ATGAAAAATA 
AGCTATTGCT 
TGTGTCAGGG 
CTTTAACAAA 
TGCCTCCAGT 
GCATTTTATT 
TTTTATAATT 
CCTTTGGATT 
TGATTTTCCT 
CCTGTGTTTA 
GGTTTATGAG 
ACCACGTATA 
CAAGTTGCCG 
TGAAATGTTG 
GCAACAGATT 
ATGAATGAAT 
AAAAAAAAAT 
TGTTGTGATG 
TTCCCAGAAA 



TGCATCCCTG 
TATTTAAAAA 
AATACAAATA 
TCTGATTTTA 
CATAATAGAA 
TATTACAGGC 
CATCTATTTT 
GTTTAAACAG 
CTGATGGGAG 
AATATAGTTG 
ATTAGGAGAC 
GCCAGGGGAG 
TGTTGAGAGT 
TCAATATCTT 
CTGACATTAC 
TTTGTAGATG 
AGTTTAAATT 
TGGATTTATG 
AGCCNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CCTTATTTTC 
CTAAGTGGTC 
ATGGACTTTC 
TAGTGCCTCC 
TTATCAGCAG 
AATAGAAAAA 
TTCAGTGTTT 
TTTCTACTGG 
TGAAGTTTGA 
TGAGTTTTTA 
TATGGCCAAG 
GGCAAAAATG 
TATGAAGAAA 
TGTTGAGGTC 
TACTGAAAGT 
GTCTGTTGTC 
TCTGTGAAGA 
TAGCATGAAA 
TTTTCTAGCT 
AGGACCTTTC 
GTGTGTGTGT 
CTGAGTCACA 
ATTTCTTTTC 
CTTTACAGTA 
AAAAAAAAAA 
AACACGGTCC 
GCAAGCAAGA 
TCACAATTTC 
GTAAAGAACC 
GAGAGAGTTC 
CTTGTATTTG 
ATCAAGAGAA 
CCTATTTTAG 
TGTATGTAAG 
GTTGCATATA 
CTTCCTAGAA 



GGAACATGCC 
TTGGTATGTT 
AACTTTTTAT 
TTACTTAATT 
TTCCTAATAA 
CAGACATGAT 
TTCATTCTGT 
CCTGCTGAAA 
GAAACAGATA 
TTGCTTTGGA 
TTCATTTTCT 
TGCAAATTTG 
CAAGATTTAA 
TTTAACCATT 
TTTGCTATAA 
TTATTCACTT 
TGTAGTAATA 
CTTACTTTGC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GTAAATCAAA 
AAAGGTGTGT 
CAGACATAAA 
TATTTTATTT 
AATATTTACA 
CAAAATCATA 
AGAGTTGATT 
ATAGATTATT 
CATAGATAAT 
CATCATTGAA 
AATTAAGACA 
ATTTTTGAAA 
TAATGTGTAG 
TGATGAATTT 
TAGGATCATC 
CTGGATGTTG 
GTTTGCAGGC 
ATTAAGGTTG 
GATGATTAAG 
CCTTTTATGC 
CATGCTGTGC 
TGGTAACTGC 
AAAGTTTACA 
AATGAAACTA 
AAGGCTCAGT 
AAAGTTCAGG 
AACATAATCC 
ATAAAATCAT 
TTCAGATGAA 
TAGAATTCTA 
CTACTGTGGC 
AATGAGACAG 
TGAATGTTTT 
GCATTTATTT 
AAAATTTCCT 
GTAATCTATT 



AGGGCAAATT 
ATAAGGTAAA 
TTTGATTCAC 
ACTACTGATA 
AATTTCTAAT 
AAAGCATTAT 
ATTATTTTAT 
TACTGTTAAA 
TGAATAATAA 
GAACAAACCG 
GCCATACATT 
AGAATAAGGA 
CTAAATTTTA 
CTGTAGATTA 
CATCATCAGA 
ACTCTGAAAT 
TATACAAAAG 
CAAAAACCAT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GTTCCTAAAT 
GTTGCTGAAT 
CTTCCCACCA 
TAGTGTTAAG 
TACCAACAAA 
GCTTATGTTT 
TTATATTGCA 
TCAAGTTCAC 
GGTGGTTTCA 
GAATTAAAAG 
TTCTGATGTA 
CAATTACTTT 
GTTTGCTAAA 
TTTTATATCT 
CATGTTGCTA 
GTGTTATAAA 
TAATATCACT 
GGGTATGGAG 
GTTGAGAGGC 
ATGCACCTGG 
TGCTTGCTTA 
ATCAGCAGTC 
CTTGGCCTAA 
TACATGAGAA 
TCTTTAAAGT 
ACTGGCTCCA 
CCAGATAGTT 
CAGAGTGCTT 
CTGAAGCAAG 
AACTACCTGA 
CATTTTATGA 
ATTTTTCTGT 
AATGGGGTTT 
CTGGCAACTT 
TATCCTTCAA 
CCAGATTTAA 



GGGGTGCAAA 
GAATTCATTA 
TAAAACTTCT 
CATAATTAAA 
AAAGCCAGAA 
GATGTGTGGT 
AAATTTTCTG 
TCAACCTGTT 
AAATGATTTC 
TTTTATTCCA 
AGCTTTGGTA 
TTTGGCATGG 
AAACCAAACA 
ATAATATAAA 
TCACAAGATT 
GTTAGGAAAT 
GAAAACACAA 
GTTGATTTTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCAGTTTCTA 
TTCAGAAACA 
AAATATTGTT 
ATTTTGTAGG 
TAGCAAAGCC 
AGTTGTATTT 
TATACTAACT 
ATGATTATAG 
GGGTCCCTTG 
ATTGTGTCTT 
CAGTTTCATT 
TTAGGATAAA 
TGTTGTGTTC 
TTAATAGTTC 
CTTAACTTGT 
TGTACAGAGT 
GTTTTTGACA 
AACTTTCTAG 
AGAAGTACAT 
CATTTATCAC 
AGGGGCTGTA 
TTAGCCTGTA 
TACTTAGACA 
GATGGGCATA 
TTCTTATCCT 
AACCCATAAA 
TAGTTTCTCT 
ATTAATTCCA 
GGTTCTTAGG 
GTAGCTTTGC 
TGGCCATAGA 
ATTATGTGCT 
ATACCGCAAA 
TCATAAAAAT 
TAGAGGATAG 
CATTGCTTTG 
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ACATAAAATG 
GACATTGGCT 
AACTTACTAG 
ATATGACCCA 
TGTTTGGCTA 
CTATTCTTTA 
TTTCCTGATA 
AAACGGTATC 
ACTTGTGTGA 
TTTGCTTAGA 
TCAACATCAA 
CCAATATGAA 
AAATACATAG 
AATCAAATTG 
CCCCATTCAC 
TTGTAAATTA 
GAAGAATAAA 
AACACTGACC 
CAAACTGGGT 
TAATTCTTGT 
TGTGGAAGGG 
AGGGAACATA 
AAAACTATTA 
ATGTGTTTTC 
AGAAATTAAT 
CAAAAGGTCT 
ATAACTTCAA 
CAAAATGGTG 
TTGTTTCAGA 
TAACAAGTTG 
GGAATTTCTT 
AGAGAATAGC 
CTTGAAATAA 
AATTTTACAA 
TTTTTAGGAA 
ATAGTAAGTT 
TTGATTATTC 
CTAAGCTAAA 
AAACAAAGTC 
AGGGAGTTGT 
GAAATATTTT 
GGCATTTATG 
TTTTTGGCAG 
AACGTTGGTG 
GAAGACAATG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GTCATGAAAA 
AACAAAATTG 
GGAGACTTAA 
CTAGTAGAGC 
AATGTGTGGT 
NNNNNNNNNN 
NNNNNNNNNN 
CTTCCACAGT 
ATACAATACA 



CAGTTTTGTG 
AAAATTTTAT 
ATCTATGTAT 
ACCTTAAGAT 
TTTGGTACTA 
TTAAGTTGTT 
AATTTGATGA 
CTTTTATATT 
TCCAGAATGT 
TTGTACCTGA 
GATATGGTGA 
AAAAAAAAGT 
CAATCATTCT 
GACTTAGAAT 
CNNNNNNNNN 
CAGTGGATAT 
ATTATCCCTT 
ATTATTAGTA 
TATTTATGTA 
TGCTATTGAC 
AAGAAAAAAA 
CAGTATGTAA 
GCCTTTCTTA 
CAACTCCAAT 
ATTCTGGAAA 
GAAATATTTA 
TTATTGGTTA 
ACTAAAATAT 
AAAGCACTGA 
ATAAATACCC 
TTTTTGGACA 
CAGGATAAGG 
TTTTGGCTAA 
AGCTGTAAAC 
AACTATATTT 
AATTTTAATT 
CTTTCTTCTG 
GGACAGTTTA 
CACCAAATGA 
ATTTCATATT 
TATTGTATGA 
CTCTTGTTTT 
TAAATAATGT 
AAATATGTGG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNGAGAATGT 
ATGTTTTTAA 
ATAGTACTAG 
AGCATATGTA 
TATGGCTTAC 
TGGCCACCAA 
NNNNNNNNNN 
NNCTAACTTT 
TCTTGACAGA 
AATGGAGAAC 



TAGTTTTAAA 
CTTGAAGTCG 
TTCAAGGCTA 
ATTACAAGGA 
ATACAACTAG 
TTACCTTTGC 
CTCAAAATTG 
TTTTCTGTCT 
AATTTGCTAT 
TGTATTTTAT 
GTTCTTTTCA 
CTAGCAAGCA 
GGATCAATAG 
TTTTTTCTTC 
NNNNNNNNNN 
TTAATCCTTT 
TTCTTACCTC 
CCATTTCAAG 
TGTTCAAGTG 
AACCAAAGCA 
TTATATTTCC 
GAAACAGGTA 
CTAATTGATC 
TTAAATAAAA 
TGAAAAATTG 
AATTTAGAAT 
ATATTTTTGT 
TACTATTTTA 
AAGTTGAATA 
AAGTCCAGCC 
TTTGTTTTGC 
TGCCAGCTCA 
TCTATGTCTT 
AAAACATTAG 
CTTAGAGATG 
CCATTATTAC 
TTTTATGAAT 
TGTTTTTAAT 
AATTACTAAC 
CATTAATTCT 
AATATGTTAT 
AGGTACATAA 
CACTTATCAC 
AATTGTAATG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCTTTAAACT 
GGCAGNNNNN 
AAATAGAAGT 
TTTTAAAGTT 
ATTAGGGAGG 
CATNNNNNNN 
NNNNNNNNNN 
GCAAGATTCT 
TGTTTGCTAT 
TCTGAGCCGA 



ATGCAAATTA 
GG TAT AT ATT 
ATTTATGCCA 
TAAAATAGAC 
TTAGAACATA 
TTACAATAAT 
GCAATTAAAG 
CATATATAAC 
TTACACTTTG 
TATTCTATTT 
AAATAACAGT 
AATGTAATTT 
TTAAATTATT 
TTATATTACC 
NNNNNNNNNN 
AAAGGCATTA 
TAAAATCTCT 
TTTCTTCCCT 
AAATGGCTGT 
GCCATGCAAG 
TCCCCAAAGT 
AGTAAAAATA 
AAAAGTTTAA 
GGATACTGTC 
GATTTGAAAA 
TTGGACTATA 
TATGAAATGT 
AACATGTCCT 
TGTGTAAGTC 
TCCTTACAGA 
AGCTTTCTAC 
AAGGTTAGAT 
GAAAGGCATA 
TTGTGTTTTT 
TGCTTATTCT 
ATTAAACTTT 
CTAATAAATT 
ACTTAAGGAA 
ACCAGCCTAT 
CCATGGAACA 
CAGTTATAAT 
AGGGTTCCTT 
CCTATGACAC 
TTTAATGGAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GATTGAAGAA 
NNNNNNNNNN 
TCAGAAATTT 
ACTAGCAGTT 
CAATGCCAAG 
NNNNNNNNNN 
NNNNNNNNNN 
TACTGGTTAT 
TATTAGTGTG 
GTTTTACTAG 



AAAAATATAG 
GATACCATAA 
AGTAGGAAAA 
TATAGAAAAC 
ATGATGTTTG 
TTAAAGTATT 
AATATTAAAG 
CATAGTCATA 
ATTTCAGCTG 
AAGGAATGTG 
AGAAACCTGA 
GTGTCTTTAA 
GCTTCAATTA 
CAAAGGAAGG 
NNNNNNNNAA 
ATTTAGCAGA 
AGGTTGATCA 
TTACTTTCAT 
GCTTTCATGG 
AAGAAAATGC 
TGGAGAGAGA 
TATCAATTTT 
ATTTTTTCAG 
TCTAGGAATC 
TACAGCATCA 
GAAGAATAGC 
TTTTTTATAA 
AGATTTTTTT 
TCCGGGAATG 
GAAAAGATCT 
GTCTTAAGGC 
GAACATATTT 
CCTTCTAAAT 
GAATTGCTTC 
ATACATAATT 
TCCGTTTCAG 
GACCTCCATT 
GCAAATATAA 
CATTAACCTG 
ATTCAGGTAA 
AGCACTTACG 
GTTTAAGATA 
TTTAGTGGAA 
TTGATTCTAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACTATCACAT 
NNNNNNNNGG 
CTTTGTTCAG 
ATAGGATTGA 
GAAAGAATAG 
NNNNNNNNNN 
NNNNNNNNNN 
AAGGAGGTGG 
TTCTCAATAT 
GATCCATCCT 
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21101 CTAAAATCAA AACTTTCATG 
21151 TCCCATTTCT CTAATAATAT 
21201 TGGTTGTCTA ACAGTGTTTG 
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214 51 AGGGATAAAG ATTTGAGGGA 
21501 AGGATCAAGG AAGACTTCAT 
21551 ATATATATAT TATCAGTATA 
21601 ACAGAGGAAG GCAATACTGG 
21651 TGAGAGCATA TCTTAAGGGG 
21701 TGATGGCCCA GCTGAATCCC 
217 51 CTGGGCACAC AGAAAAATGC 
21801 ACATTCCGAG AACTGTTAAG 
21851 TTTGAAGGGC AT CT GAT GAG 
21901 CTGGAAGGCT TTTGATATCA 
21951 TGTAGAGGAA CACTGTAAAG 
22001 GATAGCTGGT GGTAATGGAT 
22051 GCTTGGAGTG TTTGTAGTTG 
Q 22101 GGTTGGTTCA AAAATCGGTA 
% 22151 TGGGGCCTCT TAAGTAGCTG 
' ! ~ 22201 CCCTGTGGAG CTCACTTCCT 
22251 TCAACCTGGT GAGAAATTAG 
N 22301 AAGAGAAGGA GAGGGATTGA 
13] 22 351 CACAGTGTTA GGGCAGTGGC 
224 01 TGNNNNNNNN NNNNNNNNNN 
!«f. s 22 4 51 NNNNNNNNNN NNNNNNNNNN 
22501 NNNNNNNNNN NNNNNNNNNN 
'H 22551 NNNNNNNNNN NNNNNNNNNN 
vi 22 601 GTTGTGAAAG GTGAAGAAGA 
U-l 22 651 ACATCATTTG GTGAGCTAGA 
|«i 227 01 CAATTTTGGA TATGTTCAGT 
| ,B f 22751 AAACTTGATA TGTGCTTGGA 
H 22801 CTATGATTTC ATTTCTTTCC 
\\ 22851 CTTCGTCAAC TTCAGTCAAC 
H 22 901 GGAACAGCTT TAAACAGTAG 
f\ 22 951 GTTCCACATC TGCTAAATGT 
pait 23001 ATCTTTCCAG TTTACCGTTT 
23051 AACATTTGTT TATATCAAGG 
23101 TTTTCTTTTT TTCCCTCTTT 
23151 CACTCCTAAC TCTAAGAAAA 
23201 GTGAATTCGT ACNNNNNNNN 
23251 NNNNNNNNNN NNNNNNNNNN 
23301 NNNNNNNNNN NNNNNNNNNN 
23351 NNNNNNNNNN NNNNNNNNNN 
2 34 01 NNNNNNNNNN NNNNNNNNNN 
2 34 51 NNNNNNNNNN NNNNNNNNNN 
23501 NNNNNNNNNN NNNAGCACAT 
2 3551 ACAATGTATT AAAAATGAGC 
2 3 601 TTCTCCAGTC TGCTTTCCTG 
2 3 651 CTCTACATAT AACACTTGCA 
23701 GGTTATTTAT GGATCTATAT 
23751 AAGGTGGAGG CAGTGTCTTA 
23801 CAATGCCTCG TCCAGTTCAG 
2 3851 GTCATTTCTA TTTATTAATA 
2 3901 GTAATCTTAA TAAACATATT 
2 3951 ATGAATATAT TTGTCTCCCT 



TTTTCAAATG GTTTTGTTTA CCATAATCAT 
GAAGTAGAAC AATTCTTCAG TTATTCATTT 
GGATAGTTTT CTGTGCTTCT GAACCGTTTT 
CCATCCATAA ACCTCCGAGT GGAACAGTTT 
TGCTTGGGAA ATGAATTGAA GGGAGATGGA 
CATTTAGAGT TCCTGTAATT TATCCGTNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNTTTG 
CCTTCTATGA TTCTTAAATA GGAAAAAGTT 
GTTCAGGGAA AGAATCATTT CCTGCTGAAG 
GAACTCAATG GTATTCAGTT GATACTATAT 
TCTAGTATAT AGTATCAGTA TTGAAAGCCC 
GTAAGAGGCA TGTTTTAGTA TAGGATTCTG 
CAGTAGCCCA GACCAGCAGT GATGGAGTGG 
GAGGGATAAA TCAGTCAAGT GAAGAGTATC 
ATTTAAAGCT CAGTAGCCAG AGGGAGGAGC 
AATCTCAACC CAGCATATGT GGGGTATATG 
AGACGCAGTG GGAGAAGTGG GAAAGGACTC 
TATTAAATAC ATTTGCCTTA ATCACAAAGT 
ATTTTAAGCA GAGAAATTAA TATTTTAGAG 
CAGATCTAGC ACTCAGGTGA ATTTAGTTTT 
TCAGTGTCTC ACAGGAGACA CGGATATAAG 
GGTCTGCATT GCTGCGAGGA GTCTATCAGC 
GCCCATTTGG AAGAAGTATG TGCCTCATTT 
TTAGTGCAAA TGCCTGGGAC CTGTAGGAAG 
GCTTCATTTT AAGTAGTGGT GGTGGGAACA 
AAGATACTGA AAAGAAAGGA TGGAGAGGCA 
GGGGGGGCAC ATGGGCTTGG CAGACAGACT 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNCAGTATT TGTAATGCTG 
TGTAATCAAG ATTACTCCCA AGTCTCTAGG 
GAACACAGGA GAAGGTGCAG ATAATGATTT 
TTGAAGCGCT TGTGAGACTT TGAAGTAGAG 
TTGCAGGGTG AGATCTGAAC AGAACACAGG 
ACTATTCTAC TCAAACCTTC CTTCCTTTTC 
ATTGATTATT GCCACAACTA TCTCCACGGA 
AAAAAGAATG TGGGATAGGT AAAAATCTGT 
TATCTAATAG TTGTAAGTTC TTGAAAAAAA 
TGGCAAAGCA TTGTAAGCAA CTAATAATTC 
GCTGAAAAAA AGTCATCCTC TACAAACATA 
TGGTTGCCTT TTTTTGTTTT ACATCACCAC 
TATTTGACTA AAAGTGAATC ATTGTTAGTA 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
TTATTTTTTA AGTTTATATT CTGTCCTAAT 
CCTGCCTCAA GTCCTAGCCT AAATGCCACT 
ATTTTCACAG CAGGAAGTCT TCTGTACTTC 
TGAAGCATAT CTCACATTGC CCTACATTGT 
TATTTGTTCT GCTAGACTAT TAGTTTCATG 
TACATCTTTA CGCCAGTATA ACAGCTTCTG 
TTCAGTACAC TTTTGAAGGT GTAAGAGCTA 
AAAGGGATTT AAAAAAAAGA TGGACAAGAT 
GTGGTTTTCA GGAAACAGTT TAATGAGAAA 
ACCTTATGAC CTTATTACAT ATCATAGATC 
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TTCACTCTTG 
CACATAAAGG 
GTTCACAATT 
GCTGACATTG 
GCAGTGGACG 
GCTTTTCTTA 
TGGTGAATTT 
AGTTTTTCCT 
TCTTCTGTAT 
AGGAGACTGT 
TCTAGTGAAG 
ATATATATAT 
GTATGCACAG 
AGGCGTGAAA 
CTGTCTAGAT 
AAAGTGTGGA 
TATATTTGTG 
ATATCATGAA 
AATGATCTAC 
GAATGGCATG 
AGGCCTGGAT 
NNNNNNNNNN 
NNNNNNNNNA 
TTTAGAAAAC 
ATCTCTGACA 
NNNNNNNNNN 
CATAGAGATT 
TCTGGCCTTT 
CAGATGTTTT 
ATTTCATCAT 
TTCAGATATC 
ATCTTTTCCA 
CCTGTTCTGT 
TTCTGTTTCT 
CCTATTCATT 
GGCGTTTCCT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACATTGAAAA 
ATTCATGTTA 
NNNNNNNNNN 
NNNNNNNNNN 
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CTTTAGATTC 
TAATCTGATT 
ATGGGTAGAA 
CCTTATCAGT 
GCCCTTGGGT 
TGCTGCTCAG 
CTACTTATCC 
GGTATCTTTC 
TCTTTTAACA 
ACACTACAGA 
GAGCTAGATA 
TTAAAAGTAA 
AGGCTTATGA 
ATCTTGGGGG 
ATTAGTGTAG 
GGTACTCGAG 
AATGTGATTG 
ATATGGTACT 
ATGAACATGA 
CTGAAATCAT 
TGAAGTCCAT 
NNNNNNNNNN 
AGGTTCTTCA 
AAATAAGCAA 
TCATATTAGT 
NNNNNTATAC 
TTGGTTGCTA 
TGTGATGATT 
AAAAAATGAT 
ATTCTATGAA 
TATTATATTT 
AACCTTTTTG 
GTCCTTAACA 
CCTTACTAGA 
TTTATATCCC 
CCTTGTCATA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GTTAACAACT 
TTGTACTCCT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
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TAAACAGAAA 
GTGTCATCTA 
TCTTGATTAT 
GGAGGACTGT 
TTCCAGACTT 
TCTGAGTCAA 
TTTAAAATTC 
CTATTCCTGC 
CCTTGTACAC 
GCAAGTACTG 
AGAAAAACAA 
ACCTGTATAT 
AAGCACTGAA 
TGATATAATG 
CAGACAAGAG 
AGTGTATTTC 
GAAATGAGGC 
GAATGCCATG 
TGAGGAGCCA 
GGGTTAGGGA 
GATGTTGGAC 
NNNNNNNNNN 
ACAGATAAAT 
ACTAAAAACA 
CGAGGCCCAG 
TTCAACTCAT 
AAAAAGGTGA 
CTTTAGCAAA 
AATTTTCAAC 
TTCTTACATT 
CCTTTTTGAA 
ACTTTCCTAA 
TGCATCTGTT 
CTTACCTGCT 
CAATTTTTAA 
CTTATAATCC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACTCACAAGT 
TTTTATATAT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
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NNNNNNNNNN 
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TCAATTTATG 
CTGCTTACAG 
TTGAGTAGAA 
TTCTTGTTGT 
CTGCTGCTTC 
AGGCCCTTCA 
AGCTCACATT 
CTCCCTGCAG 
CTGTCATGTC 
TGTATTTAAA 
CTCTGAAAAT 
ACATATAATT 
GAGGGTATGA 
GTCAAACTGA 
GGGGAATGGC 
TAAGAACTGC 
CAATGGGATA 
CTGAGTGTTT 
GTGAATGAGT 
GGTCCCTCTA 
ACAGAAAGAC 
NNNNNNNNNN 
TTGGAATAAT 
CCTGAAGGTC 
TAACAAAATT 
GTATCAGTGA 
TGATGGTTAC 
CCTGCTCAAA 
TGAAAGAAAT 
ATGAAAGATA 
AGAATTTTTT 
GCCTGTGTAA 
ATCAGGACAT 
GGAGGATGAG 
TATAGTGTCT 
TCNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNCCTT 
TAAATAATAG 
TTCTTCCATG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
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TGCTTGAAAT 
TCCATTGGCT 
GCCTTTTCAG 
AGGCCCAGTC 
ATGCCTGTTG 
CCCATTCCCA 
TCCTCTTGAA 
AGAAATGCCT 
TATCTTTGTC 
GAGCTCAAAG 
ATGGTATAAG 
TATGATAGAA 
ACTCAGCATG 
CAGGTGAAGC 
ATTCCATGTG 
AATTAACTGC 
GGCAAGGGTC 
GAAATAGTAT 
ATAAGGAGCA 
ACAATAGTTC 
TGTNNNNNNN 
NNNNNNNNNN 
TATCATTTTC 
AAAATGACAA 
GNNNNNNNNN 
CTCTTTTAAA 
ATTAAAATTT 
ATTACAGTAT 
ACTGTAATGT 
TCTTTTATGT 
CTAGCACTAC 
TTAATCACTG 
CTCTGCTTAC 
AACTTGCTCT 
TAGACACTGG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCTCTAAATT 
TTGTCTGTGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
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NNNNNNN MNN 
NNNNNNNN NN 
NNNNNNNN NN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CTTAACGTTT 
ACAGAAAGAG 
TCAGAGGATT 
TAAAAGCTAA 
GGGGGACACT 
CTTGGTAAAT 
GTTATTGACA 
TATTTTAACT 
TGACTATGGT 
GTGAAGCCCA 
GGTAATAATA 
AACAAGCAAC 
GCTACTTTAG 
CGCTCCTTTT 
TATGTTTTTA 
TTAGTATTTT 
ATGCTGGACT 
AGCATGGAAC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TGATAACAAT 
AGATTTGACC 
TGGAAAGATG 
TT-TGTTCTGC 
CTGTACAATC 
TTGTTTCATA 
TTTGAAAAAG 
TGTCCATATC 
AAAGTCTTCC 
TTTCTCTGTC 
TTGTTCTGCT 
GAAAAAATTA 
TGAGAAAATC 
TTGCCTTATT 
CCTTAGTTTT 
AAGAAACATT 
TGATAATTCA 
AGACCCAGTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GGATTTTTAA 
ACATGTGACA 
TTTTTCTAGT 
GATTATGCTA 
TTGGACTTAA 
TGATTATCTG 
TAAACCATAT 
CTTCTTGTTC 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTCACATTTC 
TATTTGGCAT 
AAAGTTAAAA 
TTTTTTTAAG 
CTTGAGATGC 
TCAGGAATTA 
GTGTCTGAAT 
GCTCTTGGGA 
CTATGTTCTA 
TTGTGGTTAA 
AAGCAAACAA 
ATATGCTGCA 
AGGGTAATGA 
TTTTCTTAAG 
AAGAAGTATT 
ATCATTATTA 
GTTATAATCT 
TTCTGGTCTT 
TGGGATAAGA 
NNNNNNNNNN 
NNNNNNNNNN 
TGAGTTCTGT 
CAGAGGCATT 
ATCAGATAAT 
CTTTCCCCTG 
TCATGATATG 
TCCCTCCATT 
TTGCCTTCAG 
TCATTCCCTG 
TGAATTCTGT 
AGTTTCATAT 
CCCTTTTCTT 
TTTAGTGGTG 
CAAATGATTC 
CTTCTCAAGT 
ACTTTTCAGA 
CAAATGTTGG 
ATAGGACTGA 
CTGCTTATGT 
ACATCACTTA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GACTAAAATT 
TTAAAGAGCT 
TAAAACAATG 
CAGGAATGCT 
TTCTGGCAAC 
TGATTTTTAC 
TTGGATGCTA 
TGATCCACAC 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNTAC 
CAAATAAATA 
TAATCTAAAT 
TTTCAAGTAA 
TGAAAAGGGA 
ACATATACAG 
AAATTTAAAT 
TGGATGTTCC 
GACAACATTA 
AATTACCATT 
GTTCTTAATT 
CAACAAAAAT 
TCAATGGTTC 
AATTATTTTC 
TTATAGATTG 
TAAGTTATTT 
ACATTCTCTA 
CACTCTCCCC 
TTTCTGACCA 
CATTTTTCCN 
NNNNNNNNNN 
NNNNNNNNNN 
ATTCATGAGG 
ATTATTCAGT 
TGACACTATA 
GAGATCTTGG 
CAATGGCTCA 
CCACCCAGGT 
TGACTTTCCC 
TCATATTTTC 
GAAGCTTGTT 
CTATTTGTTC 
CTGTGAGCAT 
CCATGTCTGC 
TTGGAATTCT 
TACTAACTAC 
GCCTCCATCT 
CACTTTACTC 
TGTGGGAGTT 
TAGAGTTAGA 
AGATGGATAN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTAGATTGCA 
ATGTTTGTTT 
ATTTAGAGGT 
AGAGAGCAAA 
TGATTTTGGG 
CTGGACTTTA 
GTGATAGCAG 
GAAGGTCAGA 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCTACTNNNN 
GTAACACTTT 
TCCCCTCTAG 
ACTGAAAACA 
TAGTGCTTAA 
ATGCAGCCAC 
AGCGAGAGAT 
CAGACACAAA 
TAAAGACTAG 
GAGTAATTGA 
GCAATAATGT 
TATAGCAATA 
AGAATCCAGC 
TTTTCAAATA 
TGGTCTTAAC 
AAATGTTACT 
CCCCTCTCAT 
ACTTCCAAAT 
CCAGAAGAAG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GCATTTAATG 
TTTTACTGCA 
GTACAGGGTA 
ACAGTAGGAT 
GGGCACTTTT 
CCTATTAATA 
TTTTCCAAGG 
TTGTTCTGCT 
GGGTGTAAAA 
TTCTGCCCTA 
AACCTGGAGG 
ATAATAACAT 
AAAGATAATC 
CATTGAGATT 
GCTTGGGTAT 
CCAGCATGAT 
TACTTTTCTC 
AACTATTGTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATCAAACAGA 
TGCCAGTTGA 
TATTTTGAGG 
TTTTGAGAGT 
GTAAAATGTC 
CTTAAGCTTT 
ACCATCAAAT 
GAAGCAATGC 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTAATGTGTT 
CACAATGAAA 
ACAATAGTGC 
TTATGACAAA 
TTATTTTGGC 
CAAATAGATA 
AATAAAATGA 
ACACGTTATT 
CATTCGTATT 
TAAATAAAAT 
CTTCAACAGA 
ATCTACATAA 
TTCTGGCAGA 
TGCAACTTTT 
TAAATACTTT 
CCTTTCCTGC 
GCACTCAAAA 
TAGTGGATTG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TGTCTTATAG 
GCAGGAGTTT 
TTAGGAAAGC 
TCTTCTAGGT 
CAGAGGACCA 
TAGTAGTGAG 
CCAATTACCA 
CAAATGAATT 
ACAGCTTTTC 
CCCTTGCATG 
ACAAGGTTTT 
GATGTAGATT 
TTATTTCTAT 
GGATTGGCTT 
CTCAACATAT 
TACCTTACAT 
ATAACTTATT 
CAACTCTCAG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNGGAT 
AGAGGCTTGG 
AACTTGTTTC 
GCACTTTGAA 
GATTGCCCAT 
TATCATTCGT 
ATTAAGTTGC 
ACGGCCCAAA 
TGCCTTTTCT 
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GATAGCCAGT 
AATATTTTTG 
TTGATAAGGA 
AAAGGTTGGT 
TTATTGGAAG 
AGTTTTGAAA 
TAAAGTTTTT 
ATAAATTTGA 
TATCCTAAAT 
CAGACTAAGT 
GTATTTAGCT 
GGATGGTGAT 
ATTCTCAAGA 
GACATGCACG 
CCTTCTTACA 
CCATTTAGCC 
TAGGGAACAT 
NNNNNNNNNN 
AAGAATCCAG 
ATCTGCTGTT 
CTCACTAAGA 
TATATTCTGA 
CACTGCCTCT 
TGGTTCTTTC 
TTTTAGTTCC 
TGCCACACTT 
CGTATTTATT 

gctatatagg 
tttctaagaa' 
catttaatag 
agcctatcag 
ctaaacatgg 

TTGATGTGCT 
GGGAGAGGAA 
ATAAATTGAT 
GGGGCCTCAA 
GTTTTTAAGT 
TCCAAATTCC 
ATTTTGAGAT 
GATACATGGT 
AAGGTCATCT 
GTGATAGGGC 
TAAACTGCAT 
ATTAGAAGGC 
ATTGTTTCCC 
TTTCCAAGGG 
GTAGTCTTTT 
GGTGGTACAG 
GTAAAATTAG 
TGACAGTTGG 
TCGCAGGAGT 
GAGGAAACTG 
TAAAATGGGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 



AGCACCAGCA 
AAGACCACTG 
AGGGATGACC 
TTTTGCTGAC 
GTGAGCCAGT 
TGGTTATGCA 
TTTTTACATT 
TGAGTTTTAT 
TTGTATCAAA 
CCAGATTCCA 
ACAAGTAACG 
TTAGGGTTAA 
TCGCCTCATG 
TTCTAGGCGA 
CAGCATTCCA 
AGAACTTAGT 
AGTTTTCCAT 
NNCCATTCTA 
CTGGCCTTTT 
TTAAGCACTT 
AGCAGTCCCC 
AAGTCTTTTG 
AACACACCAA 
ATCTCTCTAT 
ACAGCTAAAT 
GCCTACTATA 
GAATTCCATT 
AGAATATCTT 
GAAAGAAATT 
CCATTGGCTA 
ATGGAAGTGC 
AAGAAACATT 
CAAGATTAGG 
ACAATAAAAA 
GGTTCTCAAG 
GTTTAATTGT 
GGTATTTTTA 
AATTATTTTA 
GTTTACATTT 
TATCCAGGTT 
TTTTAGATTA 
CTAGAAATTA 
ATAAACCATC 
CCTCCTGGAC 
TTTGAATTTA 
CATTTCCCTA 
ACATTAAATA 
GGTCCAGTTG 
CAAAGAGGCC 
GGGAGGAAAG 
ATTAGAGTCT 
GGAGCATAGA 
AGGAACAGGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NTGATTTTTA 



GGCACGTTGT 
AAAATGGATC 
TACTATTCAC 
CTCTGAGAGC 
TTATACAGAG 
CTTTCTAATC 
CCCCTGAGTA 
TTCTGTACTA 
AGGAAACCTT 
CAGTCTAGTG 
TGACACAAAG 
TATTATGACT 
GTCATAAAGA 
GAACACAGAA 
AAGCCCCACC 
CATTTGGCTG 
TACTAACCCA 
TATTAAAGCA 
GTGATCTGTG 
TACACTGTAG 
TTTGCTGCGC 
TTTCCTATTC 
CCTGCAGTTG 
CACAGCCCTT 
TTTCATACCC 
ATACATTATA 
CCAGAACTAA 
TTTGTGCAAC 
GATTGTATCA 
ATCTAAGGGT 
AAACAACAGT 
AACAGTGTTG 
GCACTCTGCT 
CTTTGGTCCT 
CTGGTCAGGG 
TTTATAAAGT 
AAACTACTTT 
AGGGGTTATT 
TTTTCAATAT 
GCATGCATAT 
AAAAGAATTT 
TGTAGAGTGG 
CGCCTAGTGT 
AGTTTATTTC 
ATGCCATGTT 
TCATTTACAC 
CCTCCTACAG 
GGATGACAAA 
CTGTGGAATG 
GTAGAAAAAA 
TAAAGGAAAC 
GCATATCCCA 
GTCAGGAGCT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCAGGCAGTT 



T TACT CC AAA 
ATTTATACTT 
AGAGTAATGC 
TCACATTACA 
GTTGGGAATT 
TAGTTATTTA 
TATGGTATTG 
GGATCTACTC 
TGCTGTCTTG 
ATCAAGGAAA 
AACAGTGCCC 
TCTTGCTCTA 
GGCCACTGTG 
CAAGCCGTGC 
CCATCACTTC 
CCCATATCTG 
TCCCCCACTC 
GAAGGGAAAG 
TCAGCCTTTC 
CCCCACGAGA 
CCCCCCCCCA 
CACTGCTCTT 
TAGTTATTAC 
GTTGTGGTTT 
TCTATGACTC 
TATATAACAA 
TGCCAGCAAG 
AGTTTCCAAA 
ACTTTATGAG 
TCCTGGTTAC 
TTGTTTTGAA 
GCCTGTTGGA 
TGAGAACAAA 
ATAAAGCACC 
GGTCCAAAGG 
GTCCTAAATA 
TTCATGCTTT 
TCCTGGACTT 
GGCATTCTGT 
ATAAATGTTA 
TTTTAAGCTG 
CTTACTTCTG 
ACAGTTGGCT 
ATTTCATGGA 
TAAAATCAGA 
TCTGCTTGTG 
AGCACTGCCT 
CAGGCAAGGA 
GAAGGTGAGG 
AAATAACATG 
AATGTTTGAT 
GAATGGAGAA 
TCAAANNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATGATGATAA 



CAAGAATTTT 
TTTTATTTTT 
AGTTTGCTGA 
GTGGAGTGTG 
ATTTTTCTCT 
GATGTAGAAA 
TAGGTATAAA 
ATAACTCTTC 
ATAATAAACA 
TTCAGGAATT 
TAAATTGCTA 
CCCTCTTTCC 
GTTGAGTCAT 
TCTTCAGCCC 
TGCTTTCATC 
CTAAGGAGAA 
NNNNNNNNNN 
AGATATTGGT 
TTTTGATCTC 
ACACTTTGCA 
CCACTTTGCT 
ACCTCTAACA 
ACACCCTCCT 
AGCCAATATA 
TCTAATCCCC 
ATGTTTGATA 
ATAACTTTGT 
GGGTTTTCTT 
TATCCTACCG 
TTCACTGAAT 
ATAGGACTCC 
ATGTGTGCAT 
TAACAAAAAA 
TGAAAGTACT 
CTACAGCCTG 
AAATTTTTAT 
GAGAGAGTTT 
GCACTTAACG 
GTGCCTCAGT 
AGATTTATGG 
GTATTTCTTG 
GAACTTATTT 
AAAGAGTAAT 
TATGAACACA 
TTTTAAGAAT 
TTTTCTTTCT 
AAGGATTTGT 
AGGCCTGGAA 
GAAGGGTTAG 
CACATCAGTT 
AATTATCAGA 
CAGCATGGGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATGTATGGAA 



FIGURE 3K 



33001 
33051 
33101 
33151 
33201 
33251 
33301 
33351 
33401 
33451 
33501 
33551 
33601 
33651 
33701 
33751 
33801 
33851 
33901 
33951 
34001 
34051 
34101 
34151 
34201 
34251 
34301 
34351 
34401 
34451 
34501 
34551 
34601 
34651 
34701 
34751 
34801 
34851 
34901 
34951 
35001 
35051 
35101 
35151 
35201 
35251 
35301 
35351 
35401 
35451 
35501 
35551 
35601 
35651 
35701 
35751 
35801 
35851 
35901 
35951 



ACTTCCCATT 
GGAACATAAT 
CACCATTTAC 
CTAAACAGCG 
CTCCCTTCTC 
TGAGGCTTTC 
CACAGATTTC 
GGTTTTGTAT 
TATACAATTA 
TATTTGGTAC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATTAAACAGT 
GAGAAAGTGG 
GTCCTATTTG 
AGTTGTAATG 
TGAATTGAAG 
TCCTTTCATA 
TAGTTAGAAA 
GAACTGCTAT 
CAGGACATTA 
ATAGTTTTAC 
AAATTTTTTT 
TGTGTGTGTG 
CTTTGGCTAC 
TAGTGCACGA 
AATGCATGCC 
TGGTGAATGT 
GAAAAACCTG 
GAAGACACCT 
TTTCCATGGC 
NNNNNNNNNN 
ATTAAACAAA 
TGAAGAAAAG, 
TGTTTTTCCA 
CAGACCCCAG 
GATTTGAAAT 
TAAATCTCCC 
AAAATGATTA 
TAAAGCAAAT 
ATAATCCATT 
TTATAATNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CTCGAACTCA 
CAGTTAAAAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AAAGAAAGAT 
AAAAGATAAG 
TTAAATGTCA 
ACTTGACCTC 
TCAGTGTCTG 



TCCTAGAGCT 
CATCTGATAC 
CCATCTCTTT 
GCTTACTTGC 
TGTCTACTCA 
TGCACGTATG 
ATATTGCTGA 
TTTTATCTCG 
TTTTTCATCT 
AGAANNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GAAAATATTT 
AACTCAGTTG 
CTATGTTAGG 
AATCTTAGGG 
TTCCAAAACA 
CTTAGGAATA 
TTTATCTGTA 
TTGAAGTATA 
TAAAGTTGTA 
CTAGTTTTAT 
CAGCCCCAAC 
TGTAGCTTAC 
TTTGTTGTAA 
CATCAATGCT 
ATCTTCTTTT 
ATCGGGCCAA 
TAATCAAGGT 
CTATCACTCC 
TATTTTTCCA 
NNNNNNNNNN 
TTGTATCCTC 
AATCCCTGCA 
CTAACTAGTT 
GTACTGCTTC 
TTGGAGTTTC 
TTCAACTACT 
ATTTAGCAAG 
ATTTATTCAA 
TTTATCTTTA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNTTTT 
TGCTACAAAG 
TTTCCATTAT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TAGGGTTAGA 
TCAATTTTTA 
GGTGAAATAA 
ATCTGATATA 
ATACTTATTC 



AAAGTGCATG 
CACTCACCTG 
AGCTTAATGA 
CTGTGAAATA 
TGCTTCAAGA 
TATATGGATT 
TAAATAAATA 
ATTGTAGACT 
ATTATAGTGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNTAGAG 
GAGTGATGAT 
ATGGAAAAGA 
GCATAGAAAG 
TTAAGAAAGA 
ATGTTGTTTT 
CCATTTCTAG 
TTTCATACAT 
TTTGAGGGAA 
ACTATGAATA 
TTGTCTATTT 
TGATACACAC 
AGAGTGTTTA 
GTTCTAGTTT 
ATATGATTGG 
CACAGCTGTG 
ACAAGTGCAA 
AGGAAAACAG 
CAAATTAAAA 
AAATATGTGA 
NNNNNNNNNN 
TTATTCTCTG 
TCAGTAATTT 
GGAAATCCTT 
CCTTTAATGC 
CTCCTAACTC 
GACCAGTTTG 
CACTTTTTAA 
CACTCATAGT 
CAACAACATT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GGATAAAAAA 
TCAGACAAGG 
TNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTTCACTAAA 
CTTATTTTTT 
TCATGGTGTC 
GAAATGATAT 
CAAGATACTC 



TTTCTCATTC 
ATTGTTTCTC 
GTCCCCTGTG 
TTCTTCTCTT 
TTCAACATAA 
TGCTTGTGTA 
TTGTTTTGAA 
CCTTGAGACC 
CTGGCATAGG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AATAAACGGA 
TTACAGAATT 
GTGAATGTCA 
GCCATTATAG 
TCAAGATACA 
GGTTTTTGTT 
TAAAATAAAT 
ACTTAGTACT 
TGAGTTTGAA 
AATTTCAGTT 
GAGTATTGTC 
ACATATACAT 
TAGGAAACTG 
TTTTTCTTTT 
TGTTTCGTTG 
TGCCAACCAC 
GTGTCATCCT 
TCTGACATAA 
ATATTCTTAT 
GCTGCTATTT 
NNNNNTTTTA 
ATACTAATTA 
AGAAGCCTTC 
GGCCAAATAA 
CAAAGTCAAG 
TGAGCCCTTC 
GAATGTTTCC 
AAAAATCAGC 
GCCCCAGGTA 
AGGAAGTAGA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATATATTTGG 
CTTGTCTTGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NCATTATTAG 
TAGATAAGTG 
AAATTATACT 
TAACTATCCC 
TGCTGAAATA 
CTTGAGGTAT 



TGAAATGTAG 
ACTCTTCCTC 
TATCTCCCAA 
GGGTAGTCTG 
GCCTCCTCTA 
ATGATTTCTT 
TAAGAAACGT 
AGTACCATGC 
GACATGCACA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATAAATTCCA 
TTAAAGTCTT 
AACTGAGAAC 
ATTGAAAAGC 
GAAATATCCT 
TCATTTTGAA 
ATTTTATGTT 
TTTGGACTAA 
ATTTTGGGTG 
ATGCTTATGC 
CTTGAATTTA 
ACATAATACA 
ATTTTGTATA 
ATTATTAAAC 
ACCTAGAAAT 
GATGCAAACA 
GGTTATGCTG 
ATACACAATC 
CTCAAACTAC 
TGCTGNNNNN 
TCAAAGGACA 
AAATGTATTT 
TGGTACTCCT 
TTAACCACCT 
TAGGGGATTT 
GATTCGTGAT 
ATGATAAATA 
ATCAGTTGTT 
TTTTATATAT 
GCATTATTTA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AAATGTGATC 
TAATTAAATT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GAATTAAACT 
TCAAAAATAA 
CCCTATCATC 
TTACATACTT 
CTACTGTTCT 
GTCATGTAAA 
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CTAATGATTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TGAAAACATA 
ATGAACCATC 
TAGAAGCTGC 
AGAACTGATT 
GAAATTTACC 
ATTATCTCAG 
CATAAGTTTG 
TTAGAATCTC 
GTTTCATTAA 
TATAAAATAT 
GAAATTCATT 
TAAAGACATG 
TCCTATTTTC 
ATACCTATTT 
TCCCCAAAAC 
TGGGATTTTA 
AGTCCAGTTT 
AATCATCTCT 
TGATGGCAAG 
NNNNNNNNNN 
NNNNNNNNNN 
TAGTCCTGAA 
AGCATCTTAG 
CAGGGAAGCT 
CTTCCATAAA 
TTAACTTAAA 
ATATGCTTCA 
GAGAAATGTC 
TTAGCTTTTG 
AATTTTGTAG 
TCATGGTGTA 
GCTGCCTGGG 
GTTTAATTTA 
CTTGATCTTC 
TTAGGAAATT 
GCACACGTTA 
GATTCTTTGA 
TCCCTCCTCC 
CAGNNNNNNN 
GGGGAATGAT 
AAAGACATTC 
AATATTTATA 
ATGTTGAATT 
CCAGATACTA 
TAAGCACAAA 
CAGCCGTAGG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCAACTCCCA 



ATAGACACAG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNGACACC 
CTCAAAATGG 
AGATGAAATC 
TATTCTAAAG 
TTCCCAATGC 
AAACATACCT 
TTTGCTCCTT 
CAGGTAGCAG 
AGTATACTTT 
TATATCAGCA 
AAAGAAGACG 
TAAATTTCTT 
CTAAAAGTAT 
AGAAAACCAG 
CTCTAAAGAA 
CATATTCCTT 
CACCTGCAAA 
TGAAAATAAA 
TATTTTGCCC 
GTTNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CACTGTGGAA 
AGAGTCTCTG 
GCCATGTGTG 
AGTTGTTAGA 
CTGTTTTCAT 
CAGATAAGGG 
AAAAAGAATT 
ATTAAAAGTA 
CGAGTTATAT 
AACAGCTTGA 
AAACAGCCAG 
GTAGCCTTGC 
CCCACTAAGT 
ACAGGCTTGG 
CCTTTCAAAG 
TCATTTGCTT 
TTGAGTTTCT 
NNNNNNNNNN 
TTTTAGTATG 
TTAGGAATTT 
AATGTCAATT 
CTTTATCATA 
ATTTTGTGGA 
AATCTAAGGG 
TTTTGAAAGA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AAACCAATGT 



TTTTTTTTTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GTTTTTACAA 
TGGTTCATGA 
CACACTAGCA 
TGTGTTCTTG 
TTGTTTTCTC 
GCATAATCTT 
AAAAAACTCC 
TGTTCTGGTT 
TTTTAAAGTG 
CGGTAAAATA 
ATTAAAATAA 
TGCAGAAGCT 
TATTGAAACA 
TTTAACATGT 
AAAATCGAAT 
AAAGAGAAGA 
AATAGTGCTG 
ACTACAGCTT 
TCCTCACAAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GCTTGAAAGG 
CATCCATAGA 
GCCTGGTTGA 
GAAAATTGTT 
TCAGGCCCAC 
AAATCAAACA 
AAATTTGAAA 
GCTTTTGGTA 
ACTTTAAGAA 
ATAAACAAGT 
AATACCAGGG 
CACAGGCTTA 
CATTTTCTGG 
GCTGATCATT 
CAGATGAAAT 
CCTTTACTTT 
GTACCAACAC 
NNNNNNNNNN 
TAAATATATC 
ACAGTGTACT 
CCTAATGTTT 
AGGAAAGAAT 
CTAGTTACAT 
CTTTAACTCT 
TGTATGGTTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCATTTGATT 



ACTATTTATN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCATCCAGAA 
ACCATCAGAT 
AAAATGACAT 
TATTATCTGC 
TCTAATACTC 
TTTAAATAAG 
ACTTGATTTT 
ATTGGATGCC 
GTGAACATAA 
TTGTTGTCCA 
GTCACTTTTG 
GCATCAACTT 
AAAGTTCATC 
ACTTTTTTTT 
GTATCAAGTT 
GGATTTGATT 
ACACTGAAAA 
AAAATAAACT 
AAAGCAGTGT 
NNNNNNNNNN 
NNNNNNNNNN 
NAGTTTCATC 
GTTCCACCAG 
CTTGAGGGCC 
GTGTGAGAGC 
GATAGTGCCT 
ACATTTTAAT 
GGCTCCTTTT 
TAACTTTACA 
TATGACAGGT 
ATAACCCCCA 
GCTTAACCAG 
CAAGCTGCAT 
GTGTGATCTG 
ATTTGTTACA 
AACATACTGT 
TCTAACCTGA 
TACTTTTATT 
AAACCTCTTT 
NNNNNNNNNN 
ACTGCATATT 
TTATAATAAT 
TAGCATGGTT 
TGGAGTCTTT 
CTGAAAGTTG 
AATATTAAGG 
CNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNGAAG 
TGGACATTTA 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CATTGATTTT 
GAAATTATTC 
GTCCCAATGG 
TATTTGATTT 
TTTTTAACTT 
TGATGCTCTT 
TCTTCCCCAG 
AATGTTCATC 
GCCCTCAGAG 
AATGTGAGGA 
AGATTGCAGA 
TAACATGCTT 
ACTGGGTAGC 
CCTGTAACAC 
AAGATCTTGC 
AGTAGCAAAA 
TGTAATTTCA 
TAGGTGTTTT 
GGCAGGTTCC 
NNNNNNNNNN 
NNNNNNNNNN 
TAATGGTTCT 
TTCTTGGTAC 
CTGTCAGGAT 
TGCCTAGAGA 
ACATAGCAGA 
AAAGTAGAAA 
TTCTGGAGGA 
GAACTGGAAA 
AT TC ACT GAG 
GAAACTTGCA 
TGCCTTTAGA 
TTTGGAACTG 
CTTTTGGTGG 
CCTAGAACTG 
ACTCTACAAG 
ATTCTGGCAA 
TATGCATATT 
TTCCCCCAGC 
NTTTTCCATT 
CTCAGAAATG 
TTCAGAAGAA 
TATGTTTCAT 
TAGGTCAGAA 
ACTGCTTTGC 
TGGTTACCTA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTGTATGCTT 
ACTGGCTATT 
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AATAACATAA 
TTCTTTGAGT 
GTTAACATTT 
TTCTTACATT 
TAAAATCTTT 
AACTACTATT 
CGTTTACAAT 
TTGTATAATT 
CTTTTAAAAA 
CGATGGATTG 
ACAGTGATAC 
CAGAAACTAT 
AACTGTATGT 
ATTGACATCA 
ACGAGCACAT 
CAACCCCTGG 
ATTGTCCCAG 
GTCACTTCTT 
AGCCCTGAGC 
TGTGCTCTGA 
CATTGCTTTT 
ATAGCAAAAC 
AACTGAAATT 
AGTCCTCTTC 
AAAGCATACA 
AGCCTAAGTT 
CAAGTCATTA 
GCCTGTGGGG 
GTCATGCTGG 
GTTCCTCTTT 
TTAGAATAAC 
GAGAAGAAAT 
AAAGAATTAA 
TGACATATTG 
AGCGGGGAGA 
AAAATCCAGT 
TCTGCATGCT 
GCTGTGCAGC 
TCTCCCATCC 
CAATAGTAGA 
AATGTAAGTG 
ATGAAATAAT 
ATATATATTA 
AAAAACACCA 
ATTTGAGACA 
GGAGTATTTA 
ATTGAATGAT 
GCAGCACATA 
GGTAACATAA 
GCTTCTCACC 
TAGGAAAAGT 
AATCCTATTT 
TTTTTAGGAG 
AGTGATATTA 
CAGCTGGTAG 
ATATGCCCGT 
ACCCATTGAA 
GATTTTTTTT 
TTCATGATTC 
ATTTCGGTCG 



TGGTGTTGAA 
CTCACCCATC 
TTTTGCATAA 
TAAATGATAA 
ATGTAAATCA 
GTACAACGTG 
TCAGTGTCCT 
TGTTTTTTGT 
TGATGATCTC 
TTCATTTATA 
ATTTTTAAGA 
TGTTTATAAA 
AATTGTTTAA 
GTTCAAAGTT 
CCCAGCTCCT 
ATCACCAAGC 
GTGGTCTGCT 
GCAAATGTTC 
ATGAGCAAGC 
AAAGCATGGA 
CTTTCTCTTT 
AACTTTTTTC 
GGGGGTTTGA 
CCCACTCTTG 
TTAGTGACCT 
CTGGTTCCAA 
ACCAATCAGT 
TTGGCGTGAG 
AAAGCTTTGT 
TAGGTCATTT 
TGAATTAAAA 
TTGATGCATT 
TTTCGGTCCN 
ACATCTGTTT 
AAGTCCAAAT 
AAGTTAGTTA 
AATAAAATTT 
TTGCATTTCC 
AAGCTCCTGA 
TGCACTTTAA 
GTTTATTGTA 
TCATTAAATT 
TAAACCTCCA 
AAGTTTTATT 
GTATGATCAA 
CTATAATTTG 
CTCCAATAGA 
TTAACTTACT 
TTGAAGACAA 
AGGACACCAG 
CTGATATTTG 
TATAGTATCT 
ACACCTGCCT 
ACACCAGTCA 
AGTACAGCTT 
TAATAACAGG 
TGTTGCATGT 
TGGACTCATG 
TTAACCCCAA 
AAGCCATTCT 



ACAGAACGTG 
TTATATTTTG 
AGCTTCTCAT 
TATTGACGAT 
TACATTTGTG 
ATAGTGTAGG 
TAAATAGAGC 
TTTCTTCCTT 
TTGTTTGGAA 
ATGTGTGTGT 
AAATTAAAGG 
GTATCTAATG 
TATATACTTG 
GACTCTTATT 
CTTGACCAAG 
CACAAGTTTG 
CTTCCCACCT 
TGAAGACTAG 
AGCAGGAATT 
GCCCTACTGT 
TCTGAATTTT 
ATATGTAGGC 
TGCCTTTTGC 
AGGTATAGAG 
TCTTTTGGTA 
GAGAGCACTT 
AACAGGGAGA 
AATAGAAACA 
AGGAAAATAA 
TATCTCTAAC 
TAGTTGAAAT 
TGTATATTTT 
NNNNNNNNNN 
GCCTATGACA 
GTGTTAAAAC 
TCTCCCGATT 
CTGATTTTTT 
ATTCTGGACC 
ATCCACCAAC 
GACTTGTATA 
AAGTTCTATC 
TTTTATTTTC 
GAGAAAAAGG 
TCTAGGTTCT 
TGACTTCATT 
GAAGTAATTT 
TTGTGATAAT 
TAATATGGCA 
GAAACTCTTA 
ATGCATTTAA 
GCAAATTTTG 
GAAGTGGCTT 
GTGGCAGATG 
GCTGTCAGGG 
TCTCAATTGC 
TAAAATAGCC 
TTTCTTCCTT 
ACAGCACATT 
AGTGCTTTTG 
CTAAAACCAC 



ATTGTTAAGT 
GTAAGCTAAT 
GTATGTTCCA 
CTATTATGCT 
CATAATTTTG 
GAAAAAAATC 
AGGTGGATTA 
CTGTGTATGA 
TTTTTTAAAA 
TGTCCTTGGG 
AAAAGAGAAA 
CATTCTTTTT 
CAAAGATTTT 
ACCTTCTTCC 
GCAGTGAACA 
CCTTCAAGGG 
GCATGGCTTC 
CCCAGGAGTG 
GCCACCTGGT 
CCTTTTAATG 
GAAATATGTT 
AAATTCAGGA 
CAAGGAGATT 
TTTTAGGTTA 
TACATTGTGT 
AGTTGCTATT 
CAGTAATTCC 
CCAAAAATAT 
TGTGGTAGCA 
CCATTCATAG 
TATTAATCAT 
CTACACAAGG 
NNNNNNNNNN 
TTGCCCGATT 
AAATGAAAGT 
TTTCAAGCTA 
TCCTGAAGTT 
■ AGTTCCCTTC 
CTACTGAAAT 
CATAATAACT 
CTTTAGTGTA 
CTATTTTATA 
AAGATAAGTA 
TTTATCAACT 
TTGTTCTGCT 
ATTTTTGAAT 
GAACACAGCA 
CTAGGTCAAT 
AGAAAACTGA 
TCTTTTGAAG 
ATAAACATGG 
TCATAAGGGT 
GGACAATGAT 
AATATCATCC 
TTTCCATGTT 
AGTACATCAT 
TCACATATTC 
GGCTTTTCTT 
CCATGGGAAC 
CAGCAGCTCC 



CTCAACTCTC 
AATGGCAACA 
GAAGCATAAT 
GTTATTTAAA 
GAGTATTTTA 
TTAAGTTGGT 
TAATCAATAT 
CTAAATAATA 
AATTGTATGA 
AAACAAGATG 
CCAGAAGAAC 
AAACTCTACC 
TAGGGCTAAA 
TGGTGTGAAG 
GCCTCTTTTC 
GTGAGCGTGC 
CTGCAGTTAT 
TCCAGCCTTC 
GCTGATTCTC 
CATCGAGTAT 
CTAAAAACAG 
GTTCCAAAGC 
TTAGAAATCA 
TCTGAAAATC 
AACATCAGAT 
TGATTATAAG 
TGCCCTATTT 
TAATGTCTTC 
TAATTTTACA 
GTTTAAGAAC 
TGTCTCTGCT 
AATTAGGGCA 
NNNNNGTAGT 
TAAGCACCAA 
TTATTTTTAA 
CTTTTCAGTG 
TAATAATAAT 
TCCCTCTCCT 
GTATTCCTGA 
GAAGCATTTG 
AGATAAGCAC 
GATTTTCCTT 
AATTTAAAAT 
TTTAAGATTT 
TATTATTGTA 
TTATTGCTTA 
TTTATAGAAA 
GAGAAAAGAA 
GGACAAAAAG 
CTCTGTACTT 
ATGACTATGG 
CATTGTGAAG 
GGCAGTCACT 
AGACCATCAG 
TTGGATACTT 
TTCCACATTT 
ATACAGTCCA 
TCCTTTCAGT 
GGAAGGATAA 
AGTCCAGATT 



FIGURE 3N 



4 2001 ATGAATACTA CCATAGCAAT CCTAAGGGGC CCTTTATTGG CCCAGACTGG 

42051 AGCCAAATTG TAGAAGAGCT GCCCCAACTG GGCTTCAGGT TTAGTTCAGG 

4 2101 GACATAGTGT CTGAAGAAAT TTCCATCAGC ATAAATACTC CTTGTTTATG 

4 2151 AGCTGCCTGA AACTGTAAAC ACCGAATCCA TTCCCATCAG GAACTCACAA 

4 2201 AAGTTTCTGT TATGCTTTGA AAATAAACCT AGGGATACTT AACTGACATG 

4 2251 TAAAGAAAAA TCCATCAGTA TCCTTTTCTC GAGATATAGG TTTTGATTTT 

4 2 301 ACTGTGTTAT GTTGTGGTTT TGTTCTGTTT TTGTTTAGTG CAGAAAAATA 

4 2 351 CTTAAAACAC AAAGCCTTTC TCTAACACCA TATTTGGTTT TAAATGCTAT 

424 01 TTGCTATAAT ATCAAGAAGA TTTATCAAAG ACAGGTGTGA CTCTGCAGGA 

424 51 CCATATTAGG AACAGTCATT AACTCCTCTT AGAAGATGAG AAAAGTTTCC 

4 2 501 CTCTCATTTT TAGTAAATTC TGAAATACAA AGTAGAAGAA TATGGGAAGG 

4 2 551 GTAGAAAATT TCTCAGTCAT TCTTCCTTGA TTTTTGCTTC AATTAACAGA 

4 2 601 GGGAGAAAAC ATTTTTGATA CTGTAATCTG CATGGCGCTT TCTGTGAGGA 

4 2 651 AATTTTTTTT GAAATATTTA AAATGTGATT TTGTTTTAAA AACCCTTAAC 

4 2701 TAGTCAAGAG ACAATAAAGT AGAAAATTGA AGAACGCATA GGAGATGACA 

4 27 51 AAAGTATTTG GTTTATTTAT TTAACCTTTC CGGGATTTGC CCAGCTCGGG 

4 2801 GTCTGCTTAC AGTTGCATTC AGGGCAGATA AAAGACCTAC TTGGAAAATC 

4 2851 AGTAAGATAT GAAAATTTTT TTAGATGTAA TAATTATTTG GTGGTTTTAA 

4 2 901 AACATAATGG AACTTGATGA TTTAGTTAAA TGAAATATAC AGATTTCTAA 

4 2 951 TAAATGAGCA GATCAGATTG AAATAGATTA GATTTGAAAA CAATTTTTGT 

4 3001 CACTAGACAT ATTATCTATA TTTTTTATTC AATGACATGG ATTAATAGAT 

4 3051 AATTAAATTA TGCTAAATGG GTACTTATGC TGGTGGTTCC AGACAAACAT 

U 4 3101 GTGGTCACAT TTCTTTTTAA TGAAGCATAT ATTTTGGGCA TTACTATTTC 

J] 4 3151 AATGTTTATT TGTCATTATG AGTTTTATGT TCACTTGAGG CAGTGTTTAA 

E[ |j 4 3201 TTTAGGTTAA AACCTTTTCA TAATGTAAAA TTTGTTGATT CATTTCATAT 

' 4 3251 TGATACTTAA ACCAATAAAC CTGAAATACA TGACAGTCTA TTACATTTTG 

.,]? 4 3301 TCCATTAGTG TTTCCATATG CCTTTTTTAA CTGCGGAATA AATGAAAATA 

4 3351 TGTTGGTAGG TTTTTTAAAG TTCATTTTGT GACTTGAATG CATTACTAAA 

M 4 34 01 GTAGGAAACT GAAGTTTCTT CTTTAATCTG AAATCATATA AAATCTACCA 

rfj 4 34 51 GATTGAACAA GAAAAACTAC AATATTGATC ACTCTTAATT TATTTCGTTC 

£i- 4 3501 TTATCCTCTG AAAATATAGC CTATGTAGCC TTCCTTTGGG ATGGGAAGGT 

43551 CAAAGCAGTC AAATGTTTTA AAATCTGTAT ACTTCCTGTA AAATAAAATA 

J s 4 3 601 TTCTCAAAGT CCAAGCCTTA AGGAGGCCAA TGCCTTATCC ATTTAAAGTA 

h'? 4 3 651 AAATATTCTC ATGTAATATG TTCTTTAAAC AGAGAAAGGA AGACATTGAA 

III] 4 37 01 CGAATAGGGT TACATTGTCA GGATTGGATC TAGCAATAGA ACCCAAATAT 

r, 4 37 51 TTTGAGAATA TTGGCAAACA GTTGTTAGCT GATAGAGCAT CTCTGCCCAT 

4 3801 GAGGACGTTA GTATGCTGCT GTTTCTAGGG TTGATAAGGT AGGCACCTGT 

]]\ 4 3851 TGTTGAGAAT TTTCCTAACT GGCCTTTATA AACAGAGATC CATAAAGGTC 

Q 4 3 901 ATGGCGACTC CTGGTTGCCC CTAGTGTAAC GTACCTTCAA ATTAATATTG 

U 4 3 951 CATGAACAGG ATCTATCTCT TATGTAAATG ACAGTTACTG TAGCTTGACC 

4 4 001 TATTTTTCTC TTTCATTTTT TAATATAGAA AAGAATTTTA GATGGCTTAC 

4 4 051 ATTAGTGCTG AGGTTTTGTA ATTTTTCTGC TTCAACCCAC ATTGTTTTCA 

4 4101 GATCTAAATG AGTGTGGCCT GAAGCCCCGG CCCTGTAAGC ACAGGTGCAT 

4 4151 GAACACTTAC GGCAGCTACA AGTGCTACTG TCTCAACGGA TATATGCTCA 

4 4 201 TGCCGGATGG TTCCTGCTCA AGTATGTCAA GAATCTTAAC TGTTTTATAA 

4 4 251 GTGCTTTGGG CTTGTTTCTG TTGTGCTCTG AGAGCTTGCT TTTGTGAAAA 

4 4 301 TGGCCTCCGG GGTTCTCCTA AACAAGATGT GTGCGTGTGT TTATACTTTT 

4 4 351 GCCTGAGGAA TTGAAAATCA AAATAAGAGG CACCATTTTC GATGTATAAT 

4 4 401 ACTCTTCTAT CACGGGTGCC AATATTAAAT TGATTAGGAA ATGGTGTTTA 

4 44 51 GAATAAGAAT TACAAAACTA AGTATGTTTT CTAATTTTTT TTTTAAACAT 

4 4 501 GTTTACCCAT GTCTTCTATC TGGGCATAGA AGAATGATTC CAGGCTAAAA 

4 4 551 AAAAAAAAAA AAGAACACTA CTGACTATTT AGGGACTGCT CAGACAGAAT 

4 4 601 CCATCAGGTC TGTTTTGTAA TTTTAAATCA TTCATAAATA TTTTCTTCAT 

4 4 651 TTTTATATTT CATTATAAAA GCCTTTAGGC TTTTATAGAA TTTTAGACTC 

4 4 701 TAATAATAGT CTAAGACTTC TAAAAAACAA TTTCAAATAA AAAATAAATG 

4 47 51 AATATGATAC CAGAAAAGTA GGATTCATGT TTATAAGGAA GAAATGCATG 

4 4 801 TTTTCACCAT CATTTGGATG TAAAAAATGG ACTTTGCCCT AAAATTCTCT 

4 4 851 ATAGGCTGAC TCCTGAATTG TGGTATTCAT ATGATATCTG ACAATTAATG 

4 4 901 ACTTGATTTT TATTTTTACT GAATTATCAT TTAACTGAGT TGCATAAAAT 

4 4 951 AAAGTTAGCT AATTTTATTT AACATTGCAT AAGTATAGCA TGGAACTTTT 
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GTTAATTATG 
GTGTTTTTCT 
NNNNNNNNNN 
GAGGTTGTGA 
GAGAAAAGAC 
TATAACAAGA 
TGGATGTCAG 
TCAAACTCAA 
TACTCTCAGC 
TAAGGTCTAA 
CAATATGTAA 
TGACCTTTTC 
TCCCCTGGAA 
TAGCATTTTT 
ACACACTGAA 
GAACGTATCA 
TCTCACAGCA 
ATTTAAGGCA 
TTAAAGAAAA 
AATGTGAGCA 
GTCTATAGAG 
TGGGGCATAT 
GTATAAACAA 
CAAAAGAAAG 
GTTAGATATG 
AAATTTTCAT 
GGTCTTTGTT 
AAGTCCCTTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
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NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATATTTAACC 
CAGATACTCA 
ACTCCATGTA 
CCCNNNNNNN 
CCCACTTAGC 
AATCTGTCAC 
CAATACTGAT 
ATTTCTCTTT 
GATGTTTTTT 
NNNNNNNNNN 
GTAATCCTTC 
TTCCTTCCCC 
TCCTGATACA 
AGTCTAATAT 
GCATAGAAAA 
TTTTAACCAC 
GATTTCTACC 
NNNNNNNNNN 
NNNNNNNNNN 
TAGCTTCAAA 
ACTATGAAAT 
TTGATGGGGA 
GTGAAATATT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTTTTTTAGG 
AATCCGGTGG 
AAATTCTTGT 
TTTTATAGAG 
AAGTTCTTAG 
ATAGGTTTCT 
TTGCAAATCT 
AACCATGGGA 
TAGGTCAGGC 
CCCAGTATCC 
GGGTTGATCT 
GCACAAACTT 
ACACCATAGG 
AAAACTCTGG 
GAAAAACTTT 
TCTGTTCTAG 
TATGGAAAAT 
ACCTGAAAAG 
AATACATGTC 
GACCAAAATA 
TTTCATTGTC 
TGTACTATCG 
AAGGGGCAGA 
CTGATTCTTA 
TTCATTGAAG 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TAAATTTATG 
GGGTTAATGG 
CTCAAAGACA 
NNNNNNNNNN 
AATCTTGTTG 
AAATGATCGA 
CACGGGGGGG 
GTTGCAAAGT 
CCTCAAANNN 
NNNTGGAATT 
ATATGCTTAA 
AAATAATCAA 
GCTTAATATA 
ACTTCTAAAT 
GGAATATGAA 
TACTAAAGAT 
TCTATTTATG 
NNNNNNNNNN 
NNNNNNNNNN 
GATCAGAGAT 
GTAAAAGATT 
GAACTTATGT 
AAACATATTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTATATGACT 
GCATGATTTG 
TGTTTCATTA 
CATTATTTCT 
GTCAAATACA 
TAACTGTAGG 
CTAAGAGAGG 
TTATTTGAAT 
TGGGCTAAAC 
CTCACTAGTT 
TAGATTTAGG 
AGCCAGAAAA 
TTTGCTGCAG 
TGTCACAGGG 
CTGCAGAACC 
CAATATATAA 
ATTTAATAAT 
TAGAAAGCTG 
TGCATATGTC 
GTAACCATGA 
TTTTTGTTGT 
TTGATAACAG 
GTAACAGAAC 
ATCATGTTTA 
GTTTTATTAA 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNTCATCCCA 
AAACTACAGT 
ACACAGTTGT 
CTTTCAGGAG 
NNNNNTTTTC 
ACTGGTCCTT 
GTCAAAGATA 
TTTCTTTTAA 
AGAGATTTAT 
NNNNNNNNNN 
TTATATTGGA 
CTAGCTAAAT 
AGTTAATTCA 
GCCATTGAAA 
TTTAAGCATG 
AATAGCATTG 
AGAATATACT 
AGGTACATGA 
NNNNNNNNNN 
NNNNNNTCCC 
AAGGGATCTA 
AATGTGTTTT 
GATTTAAGGT 
TTAAAATAAN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AAAAACAATT 
AGAAATATAA 
AGCAACACTA 
ATGAGACAGA 
CTGAAATATT 
TTTAAGAACT 
ACTTCTCAAT 
CATATAATGT 
CTTGGGATTA 
TAGGAAGTAT 
AGCTCTCGTT 
AATGAGGGTT 
GCAGCCATAA 
GGGATTGTTA 
GTAGGAGCAA 
CAAAGTAAAG 
TGAAATCTTA 
TTTGGAAAAT 
TACCTGCCAT 
TGGATGTGTG 
GTATATCCTG 
CTTTATCCAG 
TAATTCATTA 
CCCACTCTTC 
TTTTATTTTG 
CTTCTATTTT 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CCACTGGAGC 
TTATGCATTT 
TCATTGTCTC 
ATGACTCACT 
TTAGAAATGC 
TGAACATTCT 
TATAATAGTC 
CATGTTTTGT 
GGGCAGCTAT 
NNNNNNNNNN 
TTTAATTTTT 
TTTACTTGTT 
GATTATGGTT 
ATATTCTCTA 
ACACTGAAAA 
AAACATTTGG 
GAACCAGAAA 
CTTTGGACTA 
NNNNNNNNNN 
AGCTATCCTT 
GTGTTTGTGA 
GTTTATCACT 
TGCAACTTTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTAGCTTGAG 
TAAGATGTAT 
AAGTTATCTG 
CTTGTGCTTC 
ACACAGTTGT 
CCTAGGTTTA 
ACTCCAATGT 
ATCTTTTCCC 
GAGTTTACTG 
GGAGACCATA 
GCCTGGTTTT 
GGGGAGGGCA 
GTGGTGGTAT 
AAAAGCCACT 
TCCCAGGGCA 
GAAGTTAGGG 
TATAACTATT 
ACCCATGTTA 
TTGACTCCAA 
TAAAAGTCTG 
TTTATGATAT 
TAATTTTTAA 
TATCATTAAT 
TCAGCAGGAA 
GACATACTGA 
TAACCTTTCA 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ATTTTTCCTT 
TTAATATTGT 
CAGATCCCAT 
CTTCCCTGGG 
TTTTTTTCAA 
TTATTTTCCA 
AAATCTGACC 
AATCTAAATC 
TTAATCTCCA 
NNNNNNNNNN 
TTCATTCTAA 
GGTTCTGATA 
TTTTTTCTTA 
GAAAATTCTA 
ATAGTGAAAA 
AATTAAGGGG 
ACCTAGTCTT 
GTATGAGTNN 
NNNNNNNNNN 
TCTCACTGAG 
ATTCTGAGTT 
GTAAATAACC 
TTTGTGTGAT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GGACCCTAGG 
TTACAGGGCT 
ATGTAGCGTA 
TCAAAGTGTG 
ATGTGTTAAA 
GCTAAGTTAA 
GCTAACAATA 
AAGTTGTTTT 
GAATCAACTT 
CTCCACAAAA 
TGAGACTAAT 
GAAAGGTGCT 
GCCTGTATGG 
TTTCAAAGAC 
AAGAAACTCA 
GGACATGCTT 
AAACATGTTT 
CAAAAATGTT 
TTAGATATAA 
AAAGAAAAAA 
TTTGGGCGAT 
ACTGTTCAAC 
TCCTTCAGAG 
AGTGAGGATT 
GTTTTAAAAA 
GATTTTCCTT 



FIGURE 3S 



57 001 TACAATTGAA TGTGATAGAT ATGTCCAATT TTAGCTAGGA AATTTGTATT 
57051 ACGAGAGGGA ATATAACATC ACTTGGACCA GGAAACAGAT TTTTTTTCCC 
57101 CAGTGTTGTC AGAAGCCCAT TTTAACTAAA ATAACATAGG ATGTTTATTA 
57151 ACTAAAATAA CATAGGATGA TTATCTTAAG ATGTAGATTC CTAATCCTTA 
57201 TGACACATTT ACTAAATCAG ACTTACTAGG GAAAGTATCT CTCAGTCTTT 
57251 ATCNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57 4 51 NNNNNNTCTG AGTAGGCACA CTTAGTCCCC AACACTCAAC CTTTTATTCA 
57 501 ACCAGCGGTG AGCTGGATGT GAACATGACA GACCCAGTAG GGTTCCAATG 
57551 CCTGACAACC TGCACCTGTC AGGAAGAGCC CCCTCCTTTC CTGCTCCCCT 
57601 GCAACACATG GTTANNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
57 851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN ATAGCAGGGC TCTTTCTAAA 
57 901 GCATAATGGC AGCTCCAACC CAGCAGAACC ATATTCTGGA TGGAGTCAGA 

57 951 ACTAGCCAAA TGTAGGCTAT TTAAAATATA GCAATTGTGC CTAGGCATGT 
58001 GTCCACCTGA CTCATGTAAA ATAAAAAGAA AAGGCGTTAA TAGAAGCCAG 
58051 ATTATAGAAC ATAATGTTAT TAATATGTAT CAGGTGCTCT TATGAGTGGC 
58101 CATATCAGAG AACCAGCCTC ATTGTTGCTG TTATTACGCA ACATGAGACT 
58151 GTGCTGCAAT TCCAAGTACC AAGCAGTAAG AAGAAAATTG TTTTCTTTTA 
58201 CTGATTGCTG CATGTTGTCC TGAGGTTTTT CCCCTCATCT CTCTTCTACA 
582 51 GATTTTACAG AGTCTGTGGC TAGTTCAGTC ACATTGTTCT TAGTCATGGA 
58301 AATATTATGG TCTTTCTGAT TCTGTGTAGT GATAAGTAAA AAGATTGTTT 
58351 CTGCTGAGGG GTGAAAAGGT CTTCAAAGTA GTTTGCTTTC TTGTAAACAG 
584 01 TAAGACCTGC AGAGACCCTG AGAAGATGCC TGATATCTCC TTGAAAATTA 
58451 AATTTCTGCT AGTGTTTTGA AGGAGCGAAT TGTCACTTCT CACAGGTTAG 
58501 GATCTGCTGC TGTGAATTCT GAAAGTTTTC AAGATTTTTG ATTTATATTT 
58551 TAATTAGATC CTATGGTAGC TAGTAAAGAA AATATCATTC TCTCCAAAAC 

58 601 GTATAAATGG GCCCTTGGTG ATTGTGACCA AAGTCAGAAG TCAGATTTTT 
58 651 TTTTGAAGTT ACATGCACAT TTGTCAATAG AGTTATAATT TACAAGTATT 
58701 GTCCTCANNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
58751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
58801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
.58851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNCTAC 
58 901 TATTGATGTT GAATTCATTT ATATAGTAAT GATAACATTT CCTACTTAAT 
58 951 TCATAAAAAG ACAGCCTATG CTGTTTTCTT GTTCTGAGTT TATATGTTTC 
59001 TCATGCTTTT TATTATGGTT CATTACAATT TTAATGTTAT TTTTAACTAA 
59051 GTAGATCCTT TTGAAACAAA TTGGTTTGCA AGTGTGAGCT GTTAGGTGCA 
59101 CAGAGAAAAA TGAAAATAGA AACTTGCGAT TTTATTCTAG GCTTGTTACC 
59151 AAATATTTAG AATACTGTGT TTTATTTAGG TGTTTATAGT CTCATTAGAC 
59201 AGTTGTGATT TTAAAATAGA GACCACATCA TCTCAACTTC TTTACTGTGA 
59251 AAATAATGAC AATAGTCTTT TCAGAGATGA ATCTGTCTAG ATGGGAAATT 
59301 TACATGATTG ATCTGATGAG NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
59351 NNNNNNNNNN NNNNNNNNNN NNNNNNNAAA ATAATCTGAC AAGTAGTTTC 
594 01 CCCAGAAAAT CTGATTTAGT AAATGTACCA AAAGGATTTA GAAATCTACA 
594 51 TCATAAATAA ACATTCTATG TTATTTTAGT TCAGACCCTA TTTTAATTCA 
59501 GACTTCCTAT GGGATAAAAA CTTCCATTCT TTCTTTAAAT AGATTCTTTT 
59551 GGCTTGAGTG CATTTACACC TGTCCCCAAC AGCTGGTGGG CTTCTGCTCA 
59601 CCCTAGACGG TGTTCATGCT GCACTCAGTC CAAGCAGCCC TTATCAGAGA 
59651 GTCTTCTTAC CACTTGCATT CTGGTGCGAA GGACTCATTC CTGGCAGAGC 
597 01 CTAACCTTCA TGGGAACCAT TGGCTTAGAA GAGAGGAGAA GNNNNNNNNN 
59751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
59801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
59851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
59901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
59951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNTATCC 
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TGTCTCCGGG 
TACAATACAT 
TGATTAGCAA 
CCAATTTTGC 
ATTCTTCCTA 
GTCCAACAAC 
TTCTCTAACA 
TTTGCTTTTC 
TGTNNNNNNN 
TCATAAAACT 
GCCTCTAGGT 
CATATGTACC 
TTAAGACAGT 
TGGCCTGGGT 
AGAATTTGCA 
TACTCTTGGC 
ATTTACTTCT 
TTGACTATTT 
TCCCTGGGGA 
ACAAGAACTT 
GGCCCAGTAG 
GCCTTTATCC 
TAGTTTATGC 
TGGGTAAAAC 
AATATATGAA 
GACACCCACT 
ATGGTCATTT 
TCAGAGGCCG 
CTTTAAAAGC 
CCTTCCTCCT 
CAATTCTGAT 
CATAGTACTT 
TTATGACGTG 
CTCCCGTACT 
GGAAATTGTT 
NNNNNNNNNN 
NNNNNNNNNN 
GAAGGAGGCA 
CTTCTCTGAT 
AATTAATGGT 
AATATATATA 
GGGGAAAGAA 
NNNNNNNNNN 
CTCCCTTGAA 
TTCATAGCCT 
AGTAACCAAA 
TAACAGAACC 
TTTATATTTT 
TGAACCAATA 
CAATATTTAG 
TTTTTTTGGT 
GCTTGAATGA 
CTCTAAAGTG 
GGGATGCTCT 
TTAAATAATT 
CAATAGCTTC 
GTGGATTGTA 
TCTTAGAAAC 
NNNCATCTGA 
ATCTTCTAAC 



CAGTGTCTCT 
AAAAGAGAAA 
TGTCTATGCT 
TCTTTACTGT 
TGGGATGAAA 
CCTTTTCTTT 
TGGTTAAAGA 
TCTCTGATCA 
NNNNNNNNNN 
CTTGTCATTC 
TTTGCCACCA 
ATTCTCTTCA 
GTTATTTTTA 
AATGCCATAG 
AACCTACCTT 
AAGAAGTTGA 
CTGTGCTTAC 
GGGAAACTTC 
AAATACATAT 
TCCTGCTCCT 
CCATTAGAAA 
AGCTGTCACT 
AGAAGGCTGT 
ATGGACCAAT 
CATTCATGGT 
CTCACCTTCT 
CAGCATGCAA 
CCAGTTTCCA 
TCCTGCACAC 
GCACCTTCTT 
GTTTCAGTCC 
CTTCTTATGT 
TATCCTCCCC 
CACTAGCATC 
AAAAATCNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GGATGCATGG 
TCTTTTTTAG 
TCAACTGCAG 
TTGTGTATTT 
CAATTAGAAA 
NNNNNNNTTT 
AT C AAAG AAA 
TGCGTTCAAC 
ATACAAATAA 
CTCTTTTATT 
TTCTTTTTAC 
CAAATGAAAA 
CATATTGGCA 
TTGTGTTTCT 
TTTGTTCTGC 
TATCGGGGGT 
TAGTAATCCC 
TGGTATGGTT 
TGAGCACTTC 
ATGAAGGTAT 
CAAGAACCNN 
CATTTTACAA 
TNNNNNNNNN 



GAGTCTTTCT 
TAGGTATGTC 
CCACCTTTCA 
CAGCCCAAAG 
GCTTTGACAA 
TGGCTTATAA 
AATTTCAGAA 
CTTAAAATTG 
NNNNNNNNNN 
ATCCCTACGC 
GAAACAGCCT 
ATGAACTAAA 
ATCATTTAGT 
GAACCACCTT 
TACTAAATAA 
GCAGTGGGAG 
TGTTAATGTT 
ATTTGTGAAC 
TTAGTAGAAA 
TTTTGTGGCT 
GTACTGTGCT 
TGCTTCAGTA 
AAGAAGCAAC 
GGTATAAAGA 
TGATGTTAAT 
TCCTGTTCAT 
ACCTGATCTT 
CCAGGGATAA 
TCTGCTTCTA 
GCTGCCTCCC 
CCTCTCTTTT 
CATAGAGAAA 
AGCCTAGCTT 
CTCTGCTTCT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACAGATGGTG 
TTTCTAAATT 
AGAATAGCAG 
AAGTTGAATA 
AATCATGTAC 
AATTAACATT 
AGTGTTTGAA 
AAGTATCAAG 
CAGACATATA 
TAGATGTCTT 
AATCCTAAAA 
AAAAAATTTC 
TATTGGTGTA 
TGTACATGAT 
TACTTATTTC 
GGGTAGTTTT 
ATACTGGTAT 
ATAGAGTAGA 
TGGTCTCTAA 
ACATGATCAT 
NNNNNNNNNN 
GTTGTTACTT 
NNNNNNNNNN 



AAAACTTTAT 
ACAAAAATTG 
CCAGAGATTT 
GTGTCTGTGG 
ACAAGGTCTT 
TGAATAGATG 
ACTAATTTCC 
TGATGCATAT 
NNNNNNNNNN 
TCCAATCTGT 
TCAGAATATT 
CTCCTAAGCA 
CCAACAACAA 
GTTTCCTATG 
TGCTTACATT 
GTGTATATGA 
TTCTGGGTAA 
ACAGGTTATA 
GTGCCTGGCC 
GATTTTACCC 
CCTCAGCCCA 
CCTGTACATT 
AGGCAAGACA 
TTCTGGGGAC 
TTTAGTCATA 
TCTCCAAGCT 
GGCAATCCTC 
CGTCCAAAAT 
TCTCCAATTC 
TCCCCCTGTA 
CATTCTCCAG 
GCATCACTTC 
AGCCAGTCCC 
TTAAAGCACA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNAATGAGT 
GAGGAATTGT 
TATACCATTC 
GTATGCATTT 
TTAGCCCAAA 
CAATGTCTAT 
TTCTGCATTG 
TTTCACAAAC 
CCTATTCTAA 
ATTGCTATTT 
AGGTAAGTTT 
ACATAGAATT 
ATTTCTAATA 
TTTATCCCTC 
TGTGATAATA 
ACTTAATATT 
TTTTCTCTTG 
GTGTGAGGAA 
GCACACAAAA 
ATTCCTTGGA 
CTGCTGCTAA 
NNNNNNNNNN 
GACTAATTCT 
NNNNNNNNNN 



TAAAAGTTCC 
GGCTCATTGT 
AACTTTTTTG 
AACTTTGTAG 
ATTTCTGTGA 
GCTAAAACTT 
AGCCCTTTTA 
GTCCTGCATC 
NNNNNNCATC 
GTCTCTAAAA 
GGAATAATTA 
TAGAAGTAAT 
ATGCCAGAAC 
AAACAGGAAG 
TTGCTATAGT 
TAATATTTAC 
AAATATGCAA 
TAT GAT C AC A 
AGCATTTCTT 
CTGACTCCAG 
GTCCAACCAT 
CCCTTTCTTT 
TATCTTTTGG 
ATTTTCTAAA 
ATTCCCCTAT 
ACTACCAGTA 
TGTTAATCCT 
ATTTACAAGG 
CATTTGATGC 
AGGTCTACAG 
AGCCATAGCT 
TCAGAGATTT 
TCACTCCCCA 
CATTACCTGG 
NNNNNNNNNN 
NNNNNNNNNN 
GGATGGAGTA 
ATTGCTAGCA 
TAATTCTAAG 
GTTTTTATGA 
GTATCAAGCA 
TCCTGGGNNN 
ATCAAAATAG 
ATATTTCAAT 
GTTCTCTTTG 
ATATGTGATC 
TTTATTCATA 
ACAAAAAAAG 
CTTAGATTAC 
TTTAGTCCTT 
ACATAATACT 
TTCACATAAA 
AAACTTCAGT 
GGAAAATTAG 
TTGTAAGAAC 
ATGTTTCCCA 
ATTAAATGGT 
NNNNNNNNNN 
TTGTGTTGCC 
NNNNNNNNNN 
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63001 NNNNNNNNNN NMNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
63201 NNNNNNNACA GGCTCCCGGT GCCAGGAAAC CTTTACATCA ACACTCGATT 
63251 TGCCATTTGA TAGTCCTTCA TCTGGGAGGA AAAAAAAAGA CGGAGGGGAG 
63301 CTTGAAAAAC TGTCATAATG TCCCTGGAAT ATGGTACTTT TAAGAGTTGA 
63351 GCCTATTCCA TTTTGGAGAT GATTTATATA AGTTACAACA AAAGAAGGGG 
634 01 ACAAAAACAT GATTGTTCTA TGGAGTTTTT ATAACTTTCT GTCACAAGAA 
634 51 AGCACGCTTG TCTACAATTT TGTAATATTT CTAGTAAATA AAAGAGGCAC 
63501 TCCCCGTCTC AGAGCACCAA ATAAGGAAAG TGTAATTGGA TGTCATTGCT 
63551 GTCAGTCAGC TGGGCTATAA AAGAGAGAGT GGGGTTGCCT CATCCCCTGG 
63601 GTATCCACAG TCAGCTGTGT CCCTAGAGCT TCTTTTCTTT CATTGCTGCC 
63651 CAGCTGGGTA TATTGCAAGT ATGGATTATA AGAGGGGAAG GGACTTCACT 

637 01 GTTTTAACGT TTGAAACAAA AAGGAAAAAA CTCAGAAGTA GTAAGCTAAA 
63751 AACAACTTGT GCAAACGTTC TGGGATTATT ACTTAATTTT AAGAATTTTT 
63801 GCTAAAAACA ATAGGAACAT CGTTGAAAAA AGAACCCCTT TGAGTGATGA 

638 51 CTGTTTCATA GTGTTCTACA TCTCCTACTC CCTGCCTTAT AAAAATAAAG 
63901 CTAATTAATT GAAGTTCTGG CAAAGAGAAG GGAGTATTCC TTGGCCTTTG 
63951 ACCANNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64 001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64 051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64 201 NNNNNNNNNN NNNNNNNNNN NNAATGATTT AAAGAAATCA ATTATTTTTT 
64 2 51 TAAAGTAGAT TCTTATGCTT TGTCCCACTT GTGTCCTTTG AGGAAAAGTG 
64 301 TGAAATACTA CCTGGGGTTC TAACAGGATC TTTGNNNNNN NNNNNNNNNN 
64 351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
64 4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA 
64 4 51 CACATGTGTG TAAATAGCTG CAATTCTTTA TGAAAGTGTG TAATAGGGGC 
64 501 TTATAAGCAT GAACTCTTGA GGTACAAGAG GGAAGAATTT GATTCTGCCT 
64 551 GGGGCAGAAT CACTCCACAG AGGCCGTGAT ATTTATGCTG GGCCTTGAAG 
64 601 GAAGAAGAGA TTTTTGGCAG GTTAATTAGC AGGAAAGAGA GCATTGGGTC 
64 651 TGAGACGTGA AAATGAATGG CATGTTTGGA CAGCATTGTG TATTTTGGTG 
64 701 GGGTAGATTG TAAACTCTGG CCCAAGAAGT TTGAGTTTGA TCTTAAAGGA 
64 7 51- GCCAGGGAAA CTGTTTAAGC AAGGGAGGCC ATGGTAGGAA ATGTATATTA 
64801 GAGTGCAACA ATGTGCGGAG TTCACTGGGG GTGGGGAGTA CATGCTGTGA 
64 851 TTAGAAACCA AACCATTCGC TTCCATCATA GGTATTTTCA GTTAAAACAA 
64 901 CCTTGATTTA AAGATCATAA TAATGTATCT TTCGTAGAGA TGAACTTTAG 
64 951 AATTTCATAG AGATGAATAG TTGGCTAGAA ATTGGTTGCT CTATTAAAAT 
65001 GTATTTTCCA GATAAAGACT TTTTAAGTTA TTTTTGTTGA ATTTTAGAAA 
65051 CTTTTGAAAA TCATTTTTAA AAAATGTACC CAGATCAAGG TTGCAGTATA 
65101 AATCACTGCA GAACAAAATT CGTGAATCTT CATGTGATAT ATAATTAAAT 
65151 ATTTTGGATT CATCCAGATG AATAAAATGA CTAGGCCTTT TCCTCCTGCC 
65201 CTTGTTAAAG GTAATGCCTC TTTATCAATA CTTCACACAC ACACACACAC 
65251 AAAAACTCAA AACCTTTCTC TTGCAGTTCC ACGGCAACCT TCAAATGACT 
65301 TGTTTGAAAT ATTTGAAATA GAAAGAGGAG TCAGTGCAGA CGATGAAGCA 

653 51 AAGGATGATC CAGGTGACAC TTACACTCTT AGTGCCATCA TAATGACATT 

654 01 TTATTGTTTC TCTCCATGAA AATAACTTTT AAATGTAGAT CGTTTGGACC 
654 51 ATTTGGGAAA ATTACACCTG TTTTCTTTAA CAACTCAGAT TTTCTTTGTA 
65501 GTTAAATCAG TTGAGCNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
65551 NNNNNNNNNN TAGTACTTAA ACTGTAAAAA GGAAATCATA TTTTATAAGT 
65601 GATGATGTAA CATTGTTATA ATTTGGCAAC ACTTGTGTGT TCTATTTTTA 
65651 CAGATGATGC ACTATTGTGC TGAGAAAAGA TTAATGTAAT TTCCCTTTGG 
657 01 TCTTATGGAA AGAAGCTCTG GTTATATGGA TAGGAGAGAA CACTTGTTGT 
657 51 CTAGATACCC CTAGTTCATA GATTCATTTG TTGTTTTTTG TTGATTTTCT 
65801 GATCCACTCA TCTTTCTGAT TATTTCCTTG TAGGTGTTCT GGTACACAGT 
65851 TGTAATTTTG ACCATGGACT TTGTGGATGG ATCAGGGAGA AAGACAATGA 
65901 CTTGCACTGG GAACCAATCA GGGACCCAGC AGGTAAAACC ATTTCATTTA 
65951 ACTTTTTCTG GTACACATTT CAATGTGATA CTATCTGAAG ACTCCACTGC 
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TGCTAATCAA 
GTCAAAAAGG 
AAATTTGTTA 
TTAAAAGGTC 
GATTAATTCT 
GAATAAAAGA 
CCAGTTGNNN 
TTGAGTGGTA 
TCCTCTGAAA 
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ACACAGAAAA 
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NNNNNTCATG 
CAACGCACAT 
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AAAATAGGAA 
TTGTAAAAGA 
TCTTGCATTA 
TTATCATTAA 
ACNNNNNNNN 
CCTAGCTTAT 
TAAATTTATT 
AGTATTTGAA 
CTCTACCAAT 
TTTCTTCTAT 
AAANNNNNNN 
NNNNNNNNNN 
NNNNCTCTGT 
ATCTGCAGCA 
GTGTGCATGT 
GATTAGACTG 
ATCTTTGGCA 
ATGAATGATT 
ACTCAGGAGA 
TGAGTTTTAG 
AACATTTTTT 
CCCAAATCTC 
TAGAATGGTA 
AGAGCAAATT 
GATGATTGGC 
TTTTACCAAG 
AACTGACCAA 
GTGCTGATAC 
GAAGGTGAAA 
AACCAGTCAC 
TAGAAGACTT 
TGTGATGTTG 



GTCTACTGTA 
ACCATGCCTA 
GCTATCAACT 
AACTAATTTC 
TGCAAATAAT 
AGTAATATAG 
NNNNNNNNNN 
GTAGAAGAAC 
AAATTTCACT 
ACTCATTTTA 
AGGAGTTTTA 
AAGACATCTT 
AAATCNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNGAAG 
AGGAGTTTGG 
TAAGAAGATA 
AAGCCTAGTT 
GTTAACTTGA 
NNNNNNNNNN 
CATTTCTAGC 
TTTGAGTAAC 
NNNNNNNNNN 
NNNNATTTAC 
TATTCAATAA 
CAATATCATT 
CTGTCTTACT 
TGTCACAGAT 
NNNNNNNNNN 
TTAAAATCTC 
TCATTGCTAC 
AACCATCCCC 
CTTATAGGTG 
TGTGAAAAGT 
NNNNNNNNNN 
NNNNNNNNNN 
ACTACTCATG 
TTATTGTGTT 
GGTAGTGTGC 
ACTGACCACT 
GTCATGTATG 
CCTTCAAGTA 
ATTCACTGCA 
AATTTAAATA 
TATTATTCAT 
CCTTTCTCTC 
GACAAAGAAA 
GAAAGGAAGA 
TACGGTCCCT 
AACTTAGATA 
ACTACGGTTG 
AACCAGCATT 
TGTAATGGGG 
ACATCTGAGT 
TAGAGTTTTA 
CTGCCAGAAA 



ACACATCTCT 
G CAT T TAT AT 
AGCACTATAT 
TCTCAAAAAG 
TCTTAAGTTA 
TGGTATATCA 
NNNNNNNNNN 
ACAGTGGAAA 
CTAACTTGGG 
TAACAAAAGG 
AATTGCACAA 
TTCTTCTAGA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTAAATCTTT 
TTTCATTTTT 
CAGAATGTGA 
CTTGAATACC 
GAGAGCTCAT 
NNNNNNNNNN 
AGGCTCCCTG 
AAGGTATTAA 
NNNNNNNNNN 
TTAAAAATTA 
ATTCCAAATT 
TAGTGCCTCA 
CATTTAAAGT 
TCTGTGAATA 
NNNNNTACAA 
ATCAATTTTA 
AGAGTTGTAG 
AACAGATATT 
AAAAGGAATC 
TGATCATGTT 
NNNNNNNNNN 
NNNNNNNNNN 
CAGGCTCCTA 
TTTGACTGTA 
ATGATCTGAT 
GACCTGGAGT 
ACAGAATGGT 
AGTTACACTG 
CTTCTATGTA 
CTGTATTTTA 
TATATTTGTA 
TGATGGAAAA 
TTTGAAATCG 
GATGCCTGCT 
GGAGGATGCT 
AATAGGACAT 
ACTCAAATCT 
CGAAAAATTA 
TTGAGTCGAA 
GAGGGGGAAG 
GTGTGTAATG 
TATGGACCAT 



GTGTTTACTT 
TTCATTTCTT 
TATGGGAGAC 
CTTGATAAAT 
TTTGTAGCAC 
GCCATGGTAA 
NNNNNNNNNN 
TTTCTATAAA 
TGCTGAAGGA 
TAGATTTAGA 
TTCCTATTAA 
TTCAAGATAG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TATGGATAAG 
TTTCCTTTTT 
AGGAAAAGCT 
ATACTGGATA 
GATATCTTAA 
NNNNNNNNNN 
AAGATCTCTA 
AAAATTAGTN 
NNNNNNNNNN 
AAATGTGAAT 
GGTTCATACA 
CACACAGTAC 
TAACTCTGAG 
AGAGAGAAAC 
TATAAAATAC 
TTTAATTGTA 
CAATTTACAC 
ATAAAAGTAT 
TCTTTTAATT 
TTATCATTCA 
NNNNNNNNNN 
NNNNNNNNNN 
GACAGTCCAA 
CCACAGGCTG 
TAGCAGATCA 
AGTTTGCTAC 
GAATAATATT 
GGGAGAGTAA 
AACAATCTAT 
GAACCAATTT 
AATGTTTAAA 
ATGAGAAGAG 
TTATTGGACC 
TGAGTGATTG 
TTTCTAGAAT 
TGAACTTAAT 
GGAGAGGAAA 
GATTAATGGG 
GTTATGGCTC 
ACCTGACATA 
AAATCAATCT 
CTCTATAGAA 



GATAAAGATG 
TGAAAGAGTT 
AAGTAGTTAA 
AAAATATTTA 
CCATTCCCAG 
TAGGCATTCC 
NNNNNNAGAT 
GTTGTGATTA 
AATGGTTACG 
CATAGTTCCT 
TCTAAAGTTT 
CTTTTCTCCC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GCCTACTTAA 
ATACTCCTGA 
TGTTATTTCA 
AGAGCACCAA 
ACCAGCTCNN 
NNNNNNNNNN 
TGGTGGTCCT 
NNNNNNNNNN 
NNNNNNNNNN 
TAAATTCCTT 
TTTCTTTTCA 
TCTCAGTCAT 
CAGGACAGCA 
CAACATATCT 
CTATGACAGT 
GCAATTAAAA 
TCCTACCAAC 
TTTTTTAAAT 
TTACTTTGCA 
CTCGAGGAGC 
NNNNNNNNNN 
NNNNNNNNNN 
TACTCAGAAC 
GCCTCAGTAG 
CAGACCTGTA 
AGAAGGAGGG 
TATGGTCATA 
TTCATGCTAA 
GTAAAATACT 
AGTCCTTTTC 
TTTGTTGTGA 
GGTTAGAAAT 
GTGTTAGTAC 
GATAACTCGT 
TGGTCCAAGT 
ATTGAAGATA 
GTGACTATGT 
TCACAATTAA 
ATAGTCGAGA 
TCACTAAGGG 
AAGCGAACCT 
ATATGCTATG 
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GCATTTGACT 
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TTCAGAATGG 
TAGCTTGGAA 
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TCATGGTTTT 
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ACATGTGAAG 
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TTATAACCCA 



GAAAATCCAC 
TCTAATATGA 
GGAAAGGTAT 
TAGCCTGGAG 
AGGGTTAGAG 
GGGTAAAAAA 
GAACAGGCTG 
GGGTGACATA 
GTTGGATTAG 
AATGTGGAAA 
AGAAGGATGG 
TGAGGATGGA 
ACCCAGCACT 
TTATTCTAGG 
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GCCAGAGAAG 
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AAACTTAAAT 
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CATATTCATG 
TCAAAACCTT 
CACCAACCAC 
ACTTTCTGGT 



TTTGGTTAAG 
GAAGAATAGA 
CTCCAAGCTT 
AAGAAATGAT 
TTGTTCTCTG 
TACAGGGAGA 
ACATGGAAAG 
TGGCAAGATA 
AAAACCTCTA 
TAGTCCCAAG 
GAAGACCAAA 
ATTGCAGAAT 
GATGGGAAAA 
AAATTCAGGT 
NNNNNNNNNN 
AATATAAAAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TTACCTATTT 
ATATCCAATT 
AGATTTGTAA 
CTCAAAACCT 
TTCATGGTGC 
TTAACAATGC 
ATGCGCTGCA 
GTGTGTCAAA 
AGAACAAGGA 
CTACAAGGAA 
AAGTATGTCT 
TAAAATAAAT 
CATTCACCAC 
AGATATTTTT 
TTATCGATAC 
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TTTTTCTTTA 
AAAATTATCC 
GTCTATTTTA 
GTTCTTCAGT 
GCAAAGTGTA 
CAACACTGCA 
TGCATTTATG 
CACAAATACC 
TACAGAATTA 
CAAGTTTTGC 



TGGTGAATCC 
GTTGGTCACT 
GTGAAAAACA 
GCATCAGGGT 
TGACCGAAAG 
TTCATTTTGG 
AGTGAGTTAT 
CTGTTAAAAG 
AACCAGCTCT 
AAATTCAGTG 
AGATGGCAGA 
ATATTTTTAG 
TACTAATAAA 
GGATGGANNN 
NNNNNNNNNN 
ATCNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AATAATTTCT 
CTATCTAGAT 
AT G GAT GAT T 
ATATTCTTAT 
TAAGAAATAG 
TATTAGGTTT 
CTCTAAGAAA 
GGTATGTAGC 
GCTAGAACTA 
TTAAGGGCAA 
TTTGCTTACA 
ATTTCAGCAT 
CAACAATATA 
CCAGCAAGAT 
GGATACAATG 
TACCTTAAGA 
NNNNNNNNNN 
NNNNNTGGGG 
NNNNNNNNNN 
TGAAGCATGT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
GTATTTTTAT 
TTTTTTAATT 
TGGTATTTTT 
AAGAAATTGG 
TTTGCTACCA 
AATATAGTAT 
TGAATGTATT 
TCTAGATTTT 
AAAATAAATA 
AGCATCTATT 



ACTTTTGAGA 
TAGAAGAGGA 
GGCAAAGGAA 
TCCTTTTCAA 
GGGATGAGAA 
CTCACTGTCA 
CTTGGTGGAA 
GTTTTGAGCA 
GAGAGCCTGA 
AGCAGGATTC 
GAGTTGGAAG 
AATATGTTGA 
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TAAAACATTA 
AGTGAAAGCT 
GCAGCCAGTT 
GCATTCTCTT 
GTTAACAGGN 
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TGATTTATGG 
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ATCTTAACAT 
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GTGAATTGAA 
AACTAACATT 
ATTTACCCTC 
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CCTCCCCAAA 
TAATATAATT 
TTTAATAGGT 
TTCCAACTTG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
CTGTGATGAT 
CCAAAATATT 
GAGAAAGATT 
TCAAGGATCT 
CGTTGATTCA 
AGAGGTTTGG 
CTGGGGAGCC 
ACCTGTGACC 
TAAAAAGTTA 
AAAGAGATGA 
CCAAAATAGA 
CCAGTGTTGA 
TGAGGGAATG 
CTTTCTCTCT 
TTTTTTTTCT 
GCCCTAGGGA 
TTTTATTAAT 
TAGATATGAT 
CTCCAAATAA 
CTCTGTATCT 
ACTCTGATTT 
TAAGTTGGGG 
TCTGCTTTAC 
AATTTAGGGT 
GGACAACGTT 
ATGTTGTCGT 
AGTGTTTGGT 
ATGAGCAAAG 
GACAGGCATA 
CTTTACAAAG 
GCTTGTTCTA 
ATTCCAGGTG 
AAAAGCTGCA 
ACCTGTGCCT 
CTCCAGGTGT 
AAGAAATGGT 
CTGACATCAA 
TTTTCCTTTC 
GTGTGAATTC 
ACTCCATTCA 
TTCATTGCAA 
TTTTTTTTTA 
TTTTTTAAAG 
ATTGTTCCTG 
CTACCATAAA 
CTTTCAAGTT 
GTAGGGTGGG 
CAGGATGTGG 
TCAAGACATA 
GAATGCATTT 
TGTTGGGAGA 



ATTCTTGAGA 
GTACATTTAT 
AACTTGCTTG 
TATCATTTCC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TCAAAAGCTG 
GTTTGGATTA 
CTTAAGACAT 
ATTTCCCTGC 
AACATCCTTA 
GGGAAACTTT 
CTTAGCACAG 
TATAGAGCTC 
ATACCAAATT 
TAGTTTATAT 
TACACATGTT 
AACATACAGT 
TCATTTGTAC 
CTTGAGAAGC 
AATGCAGTTT 
TCTGAGAACA 
ATTTGTGTTC 
ATAGATTTGT 
AATTGAAGCT 
TCCTGGTGTT 
TTTTTTTAAA 
GAGGAGAGAA 
TGTTCAGAGG 
CAGTTTTACC 
ATCATAAACT 
ATGCCTATAT 
TACTAATTAT 
TAAGTGAATG 
CAAATAAGTC 
AAAAGAAAAA 
GAAAGGGATA 
GACAATATCT 
CGCTTGGTGC 
GTCATTCAGG 
TTGTGAGAAA 
GGCCATGGCT 
GAGCGTAAGT 
ATAGGAGAAC 
AGGCTCAGAT 
GTGTCTCTTT 
CCTGTGATCA 
ATGGAAAAAA 
TTGATCAACA 
CTTTCCCTTT 
TTACCCAGGA 
CAAATTTTTT 
AACTCAGAAC 
GAAGTGCTTT 
GGTAGCAAAA 
GGGAAAATGT 
TACCTATGCC 



TATTGTGATA 
CACCAGAATT 
ACACTGAATC 
TTTATTACAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACTTATAGTG 
GTTTGTGGAG 
TATGTATTTT 
AGTTTTTTGC 
AACACCTGCT 
GGTTCAGACT 
AGGCTGGCAC 
CGACGGGACT 
TTATCATGTC 
CTGTTAACTG 
CAACATTCCA 
GTGTTTTCAG 
ATGGATCAAG 
AGCACCCCCC 
CCTTAACAAC 
TCATCAAAGT 
ATGTATATAT 
TTTACTATAA 
CCAGGAAGAC 
AGATTTACAC 
GAACCTTTAC 
TAAGCAGATA 
TTAATTATCC 
AAACCACCAA 
GGTGAGGGTC 
GTCTGATCAT 
TGTGTGTCTC 
TTTTCTGTAA 
ATATTTTTCT 
AAAAAAAAAA 
TTGGCACATA 
GACAGTGTCG 
TACCTCTCGG 
CACAAGGTGA 
ACACGGTGCC 
GGAGGCAAAC 
AGATCCACAA 
TCTGGGATCT 
AAAGGTTTGG 
CTGAAACCAC 
CCACCTCTCC 
AAATCTAGGA 
GAAACTTCCA 
CCTTTTAATT 
GATGGTTCTT 
TCTTTGTTTC 
TACAATTGGA 
GATGGAATAT 
AAATGGTCTA 
CATCATTTTG 
TGTATGTAGG 



CTGTAAACTG 
ATGAAATCAG 
AATTTTTCCC 
AGGGAACGAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
AGACATGTAA 
GAAGGGGACA 
TAGTGATTCA 
TGTATCCTTG 
GCCTGCTAGC 
TTTAAGTTGT 
ACAGTAAGTG 
TGTTAGAGAA 
CTGTATAAAT 
AAAACTAAAT 
ACCATTTTTC 
TTGACTGAGA 
TTTCAAAATA 
ACTTCACTGC 
AGAGGGAAGA 
GGTCTATCTA 
AATAGGACAT 
AAATGTTTTA 
GGGGAGGGTT 
CAGTGTGAGA 
GTTTAGAGAT 
TAGGGCCCTT 
ACCAGAAAAT 
GCCAGATCGC 
ACAAGGTGGT 
ATTCCATTGG 
TGGCATTACT 
ACACATACCA 
TCTTCAAAAA 
CTAAAACTCC 
CACACAGCCC 
GCAGCCAAAG 
CCGCCTCATG 
CGGGGCTGCA 
CACGGAGCAG 
ACAGATCACC 
AGGAGGCAGG 
GAATTTGAAG 
GTTTTTTTCT 
TTTCTGCTTC 
CTAACCCTGG 
ATAGAAAAGT 
AGGGTGGGGT 
CCTTAGATTT 
TTTTCTTTTT 
AGAGATCACA 
AAGCACTATT 
ATAAATTTGA 
GGGCATCCCA 
CTGGTGCTTA 
TATAGTTTAT 



AATGAATGTT 
AAGTGAAAAT 
TTAGTTAATG 
TTCTTCATGG 
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ATATTTTGGC 
AGGGAAGTGG 
AGTACGTGAA 
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GGGATTACTA 
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AATGGCTTAC 
TCCTGAATAC 
AGCTCTGCCA 
T T T T T T T TT T 
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TAGAACTTAA 
TCTGGGAATT 
TGTCCATGCT 
TGCTTTTCCC 
CCCCAGGGGG 
CATTCAGGGG 
CTCTGGCACA 
CCCTGTGGGG 
TTGCGAGGGG 
ACCTGGGACG 
AAGCCTTGCT 
GGGCCTGATG 
TTCATCCTTC 
TAAAGATCCA 
TTTTTTGGTT 
GGTGAGCTTA 
CAAACTTTTA 
TGAAATGTTT 
GTTAAGCAGC 
CTATTCAGGA 
TCTTAAGTAA 
ACAAAATTAT 
TGAAACTTGC 
AAGAAACCAT 
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TTTAGAAAGA 
GGATTTTGTT 
TTTATTGTTT 
TACGTTTTCT 
ATGTAAGAAT 
TGATTGGATT 
GTTAACCACA 
ATCTTTCAGA 
CTCTTCTAAC 
CCTAAATCGA 
ATTCTGAGTA 
GAAAGAGAAG 
TGCAAAATAA 
CTTAATAAGG 
CATTCTCTCC 
GGGGAGATTG 
AGAACGCTAA 
CTCTTTTTCC 
AGGAGAAGAG 
GTTTTTCTGG 
TGAAATACAG 
TTCCATCCTG 
CATCCTCCAT 
AGTTACGGGA 
ATGTGTTCGG 
TGTACAATGT 
TGGCCTGACC 
GGTGTGTAAC 
GTTAAAGACA 
GTGTTTTGTG 
ATTTACAGAC 
ATACATGGTG 
GGAAATTGGC 
TTAATGGCCT 
ACAAGAAACT 
AAAACCACAG 
CTTTATAAAA 
GAGGAAGCAT 
TAAATTGATA 
GTTGTGCTGT 
TGGGATGTGT 
GGAATTCTAA 
TACCAATATA 
GTTTGTCTTA 
CCAGGAATCA 
AAAATTAAAG 
GCAAACCATT 
GGCGACCAGC 
GCTTAGACTA 
ATACTTCATA 
TAAAAAATTA 
ACCCAAGGAA 
CTACTAGTAG 
GCTTAATTTA 
GGGGAAGAGC 
GCAATCCTAG 
AGTATGATGG 
TGGTTTTCAA 
ATGGATGATG 
CAGTTCAGAG 



AAAGATAGAT 
TAATTCTTAC 
AGAGAAATAA 
CTTGTTTTAT 
AATAAAAGTT 
ATGGTGTTTG 
AAGGAACTAA 
TCTTTGGCTC 
TTTGGTAATC 
ATTGGTCTAT 
ATGTGGTCTG 
CTTCAGTGAT 
GTTTAAAATT 
AGGCATTAGA 
CTAGGTCGTC 
GATTAGATGA 
CAACTCCAGA 
AATTCTCATC 
TGGGTCAGTG 
CCTGCTTTTG 
GGGCATCGCA 
TGTCTCTTTC 
CCTGATTACA 
GCAGTTTTTA 
TGTTGTACCA 
GCTTGTGAAA 
TGAACTCTGA 
ATATCAAGAT 
ATTATGTAGA 
TAAATGTGCT 
TCTAGTTGCA 
AAATGTCATC 
AGAAGTGTTG 
TGAGTTTGAT 
GCAGTTAATT 
CTTTTACCAT 
GTTTTGGTTG 
AGGGGGAAAC 
TAAGTTTAGG 
GTCAGGATGG 
TGGAACGGGA 
GATCCATGAA 
TTAAAAAACA 
ATTTCTGATT 
CAAAGATGAT 
GAACTGGGAT 
GATGGTTTTC 
TGTTCTCCAT 
GAGTATAAGG 
AAAAAAGAAG 
GATAAAAATT 
AAGTAACAAA 
TATGCAGTGA 
GGCATTTCCC 
CCACCAAATG 
CAGTATTAAA 
ACAGATATTT 
TGTTTCTTCA 
TTTCAGATTT 
AGATTTTCAT 



TCCATTAATA 
ATATTTTTTC 
AACAAGATAA 
AAGGAGCAAG 
TTTGGTTTTT 
ATTCTATTCC 
GAAACATTTA 
CTAATATCTG 
CTTGACAAAA 
AGCTTAGAAT 
ATGAACAGTT 
ACTGGCAACT 
GGTTTGGGGG 
AAGGGGCAGA 
TTCAAAGGTG 
TGTGAGCTTG 
ACTAACAATG 
TTCTCTCCTC 
GGTCAGAAGG 
TGCAATCCCA 
GACACATCAA 
AGGAAGGCAT 
AGGTGCTCCT 
AAAGAAATCT 
TGAGTAGTAT 
TTGACTTACC 
CTTTTACTGC 
ACATTTATTT 
GTGGGCACGT 
ATTGATATTA 
AGGTAAAGGG 
CAGTTCCATG 
GGTTGTGGTA 
CTCTGCAAAG 
TAGAACTGTC 
CATAACATGG 
ATTCAGAAAA 
TCCATTGGAA 
CAGTTGTAGT 
GATAGGAAGC 
TTTACACACA 
.CCCCCAACTG 
ATGTAACTTT 
AAGTAATCAG 
TAAAGGGTTG 
TATTGAGCCT 
AAGTATATGA 
ATGCACTAAG 
GAGCATTTCT 
TGTGAAAATC 
TGTCTATTTA 
TTATAGAATT 
AAATCTTTAG 
TCTTGACCTC 
CTGAGCTCAC 
GAAAAAAGGA 
TAGTATCTCA 
TGGTAAAGGT 

CGGGTGCATT 



CAGTTTTTAT 
TTAAAACTTT 
TCATTTTAAA 
GATGCTCTAT 
ACATAGGTAA 
ATTTTCAGCA 
AGATAGGCTG 
TTCCTTTATA 
GTGTGCACTT 
GGCTTTTTAA 
TGATGACTTC 
ATATTCTGTT 
AAGGTTTGCC 
GGAGGCTTGA 
AAAAAAGGCG 
AAAAAAGGCC 
AACTCCTATG 
TTCTCCCTTT 
AAGTCTATTT 
ATGAACAGTG 
AGCCATCTGT 
TCAGCATGCG 
TGTAGCAAAT 
TTGCAGATGG 
TGACTTCCCT 
CTCTTCACTT 
CATTCACTTT 
TTATCTGTTT 
AATCCCTCCT 
AGTATTTACA 
CAGCTTGTGA 
ACCTTATATT 
ACGGAGTGAT 
GATAGGAAAC 
ACTGTTTCAA 
CTCTGGTAAT 
AGGATCCTGT 
CAGATTTTCA 
TCATAACTTA 
AAGTCCCATG 
CTGGAGGAGC 
TATTTCCTCC 
TAAAAGGCAT 
AATATTTTCT 
GAAAAAAAGA 
GGAGAAGAGA 
AGGGTTGGCA 
AATAGAACAA 
TGGCAGGGGC 
TCAGTATCTC 
AGATGGTTAA 
TCCCAAAAGA 
AACTAAATAA 
CTAATGGAGA 
TGAAATATCT 
AACTATTTAT 
GTAATGTCCT 
ATAAGCCTTT 
AAGAGATCCT 
CTCTCTGCTT 



CAGAGTACTT 
TCTCAGTATT 
TCATAGCACT 
AGAAAATATA 
AGCAACAGTG 
AGAAAGCGAT 
TGTGATTATG 
TTCTATCACA 
TATAAACAAT 
AGAATAATTG 
AGTTTCTACT 
TTTTTCCTCC 
TTTATTTTTG 
CTGGTGTGTG 
TGGTCACACT 
ACTGCTCTGA 
TTGCTCTATC 
TATCAGGCCT 
GGTGACCCAG 
ATACCCTCCT 
GGGTGTTGCC 
TGAGCCATAC 
TATGAGAGTG 
CTATGATGTT 
TGAGATATGA 
AAGTTAGTTC 
ATAAAATAAG 
TTTTTTTCCT 
TAGTAGTATT 
TGTTCCAAAT 
TCTCAAAAAA 
GGCAGCAGTA 
GAATTTTTTT 
CTTTAGGAAG 
GTTACACTTT 
ATGTAGGAAG 
TGCAGAGTGA 
CACAACGTTT 
TGTTGCTCAT 
CTTAGAGGCA 
AGGGCAAGTT 
CTGCATATTT 
CATTCCTGAG 
GCTGTTTTTG 
TCTATGATGG 
AGACTGAGGG 
CAGAGAGGGT 
GAGGAAACTG 
CATTGTTAGA 
TCTCTCTTTC 
AGATGTTCTT 
TGTTTTGATC 
TTTGGACAAG 
GGGATTGAAA 
CTCCCTTATG 
TCCAAATGAG 
AGTGTGGCGG 
CATTTGTTCA 
TCAAGGAACA 
CGTGTGTGAC 



FIGURE 3Z 



li 


3001 


AAGTTATCTT 


li 


3051 


TTTCCTTCAC 


li 


3101 


CCCAGGTTCC 


li 


3151 


GGTCTTCTGT 


li 


3201 


ATCATTAAGC 


li 


3251 


TAATGGTTCA 


li 


3301 


GGGGAAAATA 


li 


3351 


TAAACTGATG 


li 


3401 


ATTTAATGTT 


li 


34 51 


ATAAAAACAA 


li 


3501 


TATTCTTGTT 


li 


3551 


AGGTAGTTAA 


li 


3601 


TTTGGATGGT 


li 


3651 


TATAAAACAT 


li 


3701 


GATCAGAAGA 


li 


3751 


TATGTTTTAT 



GGCTGCTGAG 
CTCATCAGTA 
ACAGAACAGT 
CATTTAACCT 
CTTTGAGTAA 
TTTCCTTTAT 
AATGAAAATT 
GAAGAAGTTA 
TTCTAAAATA 
TTATTTGTAA 
TTTTGAGTTT 
ATGATGTTTC 
GGGAACAAAG 
GAGACACTTT 
TCACTATAAC 
TATCATTTCA 



AAAGAGTGCC 
TGATTCAGTT 
AATATTTTTT 
GGTAAAGGCA 
CGGCAGAATA 
GGTCATATAA 
TTACTTTTCG 
TCCAAAGTAC 
AAAAATGTTA 
ATAAAAACAC 
TGTTTTTTTT 
ACAAAAGTCA 
ATGTTGCCTG 
ATTTTTTTTA 
CCATGCCATG 
TTGTAAACAT 



CTGCCCCACA 
TCTCTTATCA 
GAACAATAGG 
GGGCTGGAGG 
TATGGCTGTA 
CTGCACAGCT 
ATGCCAATGA 
TGTATAACAT 
GTGGTTTTCC 
TGTTAGTAAT 
TGACTTGGAA 
TAGTAGAATC 
CAGTATTATA 
TCAGCATGAA 
CCTTAGTAAA 



CCGGCAGACC 
ATTGGACTCT 
TACAATAGAA 
GGGAAAATAA 
GATCCATTTT 
GAAGATGAAA 
TACATTGCAC 
CTTGTTTATT 
AAATGGCCTA 
ACCAGTTGTC 
AAAAGCATTG 
CCTTTTACTG 
CTTTCTAGGT 
CAGGGAAAGA 
TTGCTTTAGT 



TTGCT (SEQ ID NO: 3) 



FEATURES : 

Genewise 



results : 



Start : 


3000 




Exon : 


3000- 


3070 


Exon : 


5248- 


5348 


Exon : 


19533 


-19583 


Exon : 


34426 


-34518 


Exon : 


39701 


-39790 


Exon : 


44102 


-44221 


Exon : 


45394 


-45513 


Exon : 


47168 


-47302 


Exon : 


47607 


-47729 


Exon : 


49126 


-49127 


Exon : 


49132 


-49522 


Exon : 


65277 


-65363 


Exon : 


65834 


-65932 


Exon : 


74008 


-74264 


Exon : 


75715 


-75807 


Stop: 


75808 





CHROMOSOME MAP POSITION: 

chromosome 4 



ALLELIC 


VARIANTS 


(SNPs) : 




DNA 








Position 


Ma j or 


Minor 


Domain 


3365 


C 


T 


Intron 


3411 


G 


A 


Int ron 


5243 


G 


A 


Intron 


6532 


T 


G 


Intron 


9208 




G 


Intron 


15833 


T 


C 


Intron 


27795 


G 


T 


Intron 


31341 


A 


G 


Intron 


33370 


G 


A 


Intron 


37788 


G 


A 


Intron 


41465 


C 


T 


Intron 


41466 


A 


G 


Int ron 


41653 


T 


G 


Int ron 


47666 


G 


A 


Exon, ( 


52613 


G 


A 


Intron 



Protein 

Position Major 



Minor 



281 



FIGURE 3AA 



52645 


G 


A 


Int ron 


59197 




A 


Int ron 


63508 


T 


C 


Int ron 


75153 


C 


G 


Int ron 


76017 


C 


G 


Int ron 


76033 


c 


T 


Intron 


77194 


A 


G 


Int ron 


77508 


T 


A 


Int ron 


77557 


G 


A 


Int ron 



Context : 



DNA 

Pos i t ion 

33 65 CGGGAGGTGAGCTGGGCCCCGGGGCGCCCTCTCCTCCTTCCCGCGCTAATTTCACACTCA 
CTGTCTTGGGTCACTTTTCCCCGCGGGGTTTCGTGGTCAGAGAGGCGTCTCCTCCATCCA 
GAAGTTGGGCCACCGCACAGCGTGGCGCGAGGAGAGCGGTCCAGCGGCTCCGAGTGCCCG 
CCCGAGGCGGAGAGGGCGCGCCCTTGCGAGTCTGGGACCCCATCCGCGGCCCCCCGAGGG 
CGACTCGCCCCGGCTCGGGAATTAGGACTGAGGGAGAGGAGCCGCTGGAGCCTGGGATCT 
[C,T] 

GGCTCTGAGGGCGCGGTTTAGCCACCTACGCCGAGGTGACGCGCGAAACATCCCTTACCC 
GGGAAACTCCCGCGCCTGAACTAGACGGCTCTTCACTGGGGAAGCTTCCAGGCCCCCGGG 
GGGAGGCCCGGGCTCTGCTCAGGGCTCTCGGGGCCGCTCACACAGAGAGTGGGTGCGAGT 
CAGCGACTGGGCTACGGGGGAGATTTGTGGGCCTCTCCATTTGGTTTTCTTGAGGGAAGG 
AGACTCAAAATGAGGACCGGAGGGTGGGCGCTCCGTGAATGTGAGCATGAGTGTGTGGAT 



3411 T AATTTCACACTCACTGTCTTGGGTCACTTTTCCCCGCGGGGTTTCGTGGTCAGAGAGGC 

GTCTCCTCCATCCAGAAGTTGGGCCACCGCACAGCGTGGCGCGAGGAGAGCGGTCCAGCG 
GCTCCGAGTGCCCGCCCGAGGCGGAGAGGGCGCGCCCTTGCGAGTCTGGGACCCCATCCG 
CGGCCCCCCGAGGGCGACTCGCCCCGGCTCGGGAATTAGGACTGAGGGAGAGGAGCCGCT 
GGAGCCTGGGATCTCGGCTCTGAGGGCGCGGTTTAGCCACCTACGCCGAGGTGACGCGCG 
EG, A] 

AACATCCCTTACCCGGGAAACTCCCGCGCCTGAACTAGACGGCTCTTCACTGGGGAAGCT 
TCCAGGCCCCCGGGGGGAGGCCCGGGCTCTGCTCAGGGCTCTCGGGGCCGCTCACACAGA 
GAGTGGGTGCGAGTCAGCGACTGGGCTACGGGGGAGATTTGTGGGCCTCTCCATTTGGTT 
TTCTTGAGGGAAGGAGACTCAAAATGAGGACCGGAGGGTGGGCGCTCCGTGAATGTGAGC 
ATGAGTGTGTGGATGTGTGTGTGAGAGCGCACACACTGCGCCGCTCCTCAGACTCGGGCG 



524 3 GGCAGCCTTGAGAATTGTTGGTGTTAGCTGGTGCTATTTCTACTTAAGAAAGCATCTTTC 
CCTCCCGTTTTTTCTCCTACCCTTGTGTGTTGGGGGATGGGGTATTTAACTGTCTGAAAT 
TTGACATTAGATCATAGCAGAACTATTTCTGTGAAGGTGTTTTAACCTTAAATTTCTGCA 
GGCATAAAAGAGTTTGTAGAAAATTGTTGGGTGTGGTTGACATTTTTTCTGAAGTAATAA 
TACAGAGG7VAATTACCTTTCTTCTCAAAAGTGTCCTTATTTATTTTGAATCTTTTTTTTT 
[ G, A] 

GTAGGTGGCCCAGGCAAATAGTGTCATCGATTGGCCTATGTCGTTATGGTGGGAGGATTG 
ACTGCTGCTGGGGCTGGGCTCGCCAGTCTTGGGGACAGTGTCAGCGTGAGTATCAAGCCT 
GGGGACTTCAGTTCCCTGGGAGGTGTGGCTTTCCACCTTGTTCATGGCTTCACCCCACAT 
ATCAGAGGGTTCATTACTGAGCAAGGCTTGGCCTTGCAGGTCTGACTTGGGGATTTTCAG 
GTACAGTCCAGACTCCTTATTCTGCTTCTTTTCAGCTTTAGCCACCTGTATTACGGCCCA 



6532 GGGCGGGGGTAGGGGGCACTTCTGGTAAGATGATGGGAACTAAGTTGGGTTCTACATTGG 
GATATATATTTTATTGCTAATGAGGAGGAGGCTTAGAGGAAGAGAGAAGGGCAGTTACGA 
AGGCTAGAGCTGGCAATGGAGAAGCCTGCCTTAGAGATGGGTTGCTAGTGTGAGGAGTCA 
GGCAAATTTAAGTTCAGGAAAGTTAGGAGTTCCCTCTGCTATTTTAATTTTTGAGGATGC 
TTGCAATGTCTTCCTTAATTTTGTGAAAGAGGGACAGTGACAGTCACAGATTGACTCTAA 
[T,G] 

TGCACATAAAGACCACAATCTCTGGTTGGGAATAGAAAGGTAAAGGAAATGAATGTTTGC 
CTACCTGGTATGGAATTTGAGAACCAACAGATTCTAATAACCAAAATGTGAAGAAAGGAC 
CCTTCTGTTGGCCCAACACACCTACACATAACCCTCCTGAGTGAAAAATGAGTAGTTCTA 
TACCTGCAGTCTCCAGCTGTGCAAATACTTCTGATACTACAGAAGACTAAATTCCACCAG 



FIGURE 3BB 



GCACCATTCTTCTTTTTTGATCATCTTCCCTTAAAACAATATTGAATAGACTAACCAGTG 



9208 



15833 



pj 27795 

6 

4 



31341 

li 

si 



33370 



37788 



TCTGAGTTCTTTGGCTGTCATTGAGCAACTTGTGGTTTCTGAAGGAAATTATGTGAATTA 
GGATGGTTTTGTATCATTTATCCTTAAGAACAGGGAAAATTGAGATGTTTTCTTATGTTT 
CTGCTGGAGATTTTGGAAAGATGTGAAACCTACACCTACAGATTGACCTTGCTTAGTTAG 
CTCTGAACCTCCTGCTGCCTCTTCCACGTAAAGTGAAAATTTTGGATTCTTATCGGCTTC 
AGATAAACTTACAGGTTAGTGAAACATAGGGACTGAGATATAGTAATTCATTCTGAAGCT 
[-,G] 

TTTTGGAGTGGTCAAATAATTTTAGTTGGATAATATATATTACTGGCTAATGATTGTGGA 
TATTGGAAGTGATGAAAAAATTATTGAATTATTTCTTTCTGCATTTCAAATGAAAAGGCT 
ATTAGTTTGAGCAGAGAATTTTGATTTAGTAAACAAAATATTTAAATTTCATGTTTCATT 
TCTTTCTCCTATCTGGGTTCAGATACTCAGTCTTATAAATGGAACATGATTTATTTTTGC 
TCCCTAAACTGGTTATTAACTTCCTGTCCATAATCACAAAACTATATAGATTATATATTT 

CATACATTAGCTTTGGTAGATACTAGATGCCAGGGGAGTGCAAATTTGAGAATAAGGATT 
TGGCATGGGTTATTTGCATGTTGAGAGTCAAGATTTAACTAAATTTTAAAACCAAACACT 
TCATTTGATCAATATCTTTTTAACCATTCTGTAGATTAATAATATAAATTCTCCAGAGCT 
GACATTACTTTGCTATAACATCATCAGATCACAAGATTAGGGTTGCCTTTTGTAGATGTT 
ATTCACTTACTCTGAAATGTTAGGAAATATGTCATCACAGTTTAAATTTGTAGTAATATA 
[T,C] 

ACAAAAGGAAAACACAACTAGGAATTTTGGATTTATGCTTACTTTGCCAAAAACCATGTT 
GATTTTCAAAAACCTTTAGCC 

ATGAAAACAGAAAGAGAAAGTTAAAATTTCAAGTAAACTGAAAACAACAATAGTGCTCAG 
AGGATTTTTTTTTAAGTGAAAAGGGATAGTGCTTAATTATGACAAATAAAAGCTAACTTG 
AGATGCACATATACAGATGCAGCCACTTATTTTGGCGGGGGACACTTCAGGAATTAAAAT 
TTAAATAGCGAGAGATCAAATAGATACTTGGTAAATGTGTCTGAATTGGATGTTCCCAGA 
CACAAAAATAAAATGAGTTATTGACAGCTCTTGGGAGACAACATTATAAAGACTAGACAC 
[G, T] 

TTATTTATTTTAACTCTATGTTCTAAATTACCATTGAGTAATTGACATTCGTATTTGACT 
ATGGTTTGTGGTTAAGTTCTTAATTGCAATAATGTTAAATAAAATGTGAAGCCCAAAGCA 
AACAACAACAAAAATTATAGCAATACTTCAACAGAGGTAATAATAATATGCTGCATCAAT 
GGTTCAGAATCCAGCATCTACATAAAACAAGCAACAGGGTAATGAAATTATTTTCTTTTC 
AAATATTCTGGCAGAGCTACTTTAGTTTTCTTAAGTTATAGATTGTGGTCTTAACTGCAA 

CCACTTTGCTTATATTCTGAAAGTCTTTTGTTTCCTATTCCACTGCTCTTACCTCTAACA 
CACTGCCTCTAACACACCAACCTGCAGTTGTAGTTATTACACACCCTCCTTGGTTCTTTC 
ATCTCTCTATCACAGCCCTTGTTGTGGTTTAGCCAATATATTTTAGTTCCACAGCTAAAT 
TTTCATACCCTCTATGACTCTCTAATCCCCTGCCACACTTGCCTACTATAATACATTATA 
TATATAACAAATGTTTGATACGTATTTATTGAATTCCATTCCAGAACTAATGCCAGCAAG 
[A,G] 

TAACTTTGTGCTATATAGGAGAATATCTTTTTGTGCAACAGTTTCCAAAGGGTTTTCTTT 
TTCTAAGAAGAAAGAAATTGATTGTATCAACTTTATGAGTATCCTACCGCATTTAATAGC 
CATTGGCTAATCTAAGGGTTCCTGGTTACTTCACTGAATAGCCTATCAGATGGAAGTGCA 
AACAACAGTTTGTTTTGAAATAGGACTCCCTAAACATGGAAGAAACATTAACAGTGTTGG 
CCTGTTGGAATGTGTGCATTTGATGTGCTCAAGATTAGGGCACTCTGCTTGAGAACAAAT 

CCACTCACCTGATTGTTTCTCACTCTTCCTCCACCATTTACCCATCTCTTTAGCTTAATG 
AGTCCCCTGTGTATCTCCCAACTAAACAGCGGCTTACTTGCCTGTGAAATATTCTTCTCT 
TGGGTAGTCTGCTCCCTTCTCTGTCTACTCATGCTTCAAGATTCAACATAAGCCTCCTCT 
ATGAGGCTTTCTGCACGTATGTATATGGATTTGCTTGTGTAATGATTTCTTCACAGATTT 
CATATTGCTGATAAATAAATATTGTTTTGAATAAGAAACGTGGTTTTGTATTTTTATCTC 
[G,A] 

ATTGTAGACTCCTTGAGACCAGTACCATGCTATACAATTATTTTTCATCTATTATAGTGT 
CTGGCATAGGGACATGCACATATTTGGTACAGAA 

GCCCTGTCAGGATCAGGGAAGCTGCCATGTGTGGCCTGGTTGAGTGTGAGAGCTGCCTAG 
AGACTTCCATAAAAGTTGTTAGAGAAAATTGTTGATAGTGCCTACATAGCAGATTAACTT 
AAACTGTTTTCATTCAGGCCCACACATTTTAATAAAGTAGAAAATATGCTTCACAGATAA 
GGGAAATCAAACAGGCTCCTTTTTTCTGGAGGAGAGAAATGTCAAAAAGAATTAAATTTG 




AAATAACTTTACAGAACTGGAAATTAGCTTTTGATTAAAAGTAGCTTTTGGTATATGACA 
[G, A ] 

GTATTCACTGAGAATTTTGTAGCGAGTTATATACTTTAAGAAATAACCCCCAGAAACTTG 
CATCATGGTGTAAACAGCTTGAATAAACAAGTGCTTAACCAGTGCCTTTAGAGCTGCCTG 
GGAAACAGCCAGAATACCAGGGCAAGCTGCATTTTGGAACTGGTTTAATTTAGTAGCCTT 
GCCACAGGCTTAGTGTGATCTGCTTTTGGTGGCTTGATCTTCCCCACTAAGTCATTTTCT 
GGATTTGTTACACCTAGAACTGTTAGGAAATTACAGGCTTGGGCTGATCATTAACATACT 

414 65 TTTATTTCTAGGTTCTTTTATCAACTTTTAAGATTTATTTGAGACAGTATGATCAATGAC 
TTCATTTTGTTCTGCTTATTATTGTAGGAGTATTTACTATAATTTGGAAGTAATTTATTT 
TTGAATTTATTGCTTAATTGAATGATCTCCAATAGATTGTGATAATGAACACAGCATTTA 
TAGAAAGCAGCACATATTAACTTACTTAATATGGCACTAGGTCAATGAGAAAAGAAGGTA 
ACATAATTGAAGACAAGAAACTCTTAAGAAAACTGAGGACAAAAAGGCTTCTCACCAGGA 
[C,T] 

ACCAGATGCATTTAATCTTTTGAAGCTCTGTACTTTAGGAAAAGTCTGATATTTGGCAAA 
TTTTGATAAACATGGATGACTATGGAATCCTATTTTATAGTATCTGAAGTGGCTTTCATA 
AGGGTCATTGTGAAGTTTTTAGGAGACACCTGCCTGTGGCAGATGGGACAATGATGGCAG 
TCACTAGTGATATTAACACCAGTCAGCTGTCAGGGAATATCATCCAGACCATCAGCAGCT 
GGTAGAGTACAGCTTTCTCAATTGCTTTCCATGTTTTGGATACTTATATGCCCGTTAATA 



414 66 TT ATTTCTAGGTTCTTTT ATCAACTTTT AAGATTT ATTTGAGACAGTATGATCAATGACT 

TCATTTTGTTCTGCTTATTATTGTAGGAGTATTTACTATAATTTGGAAGTAATTTATTTT 
r| . TGAATTTATTGCTTAATTGAATGATCTCCAATAGATTGTGATAATGAACACAGCATTTAT 
j : L AGAAAGCAGCACATATTAACTTACTTAATATGGCACTAGGTCAATGAGAAAAGAAGGTAA 
CATAATTGAAGACAAGAAACTCTTAAGAAAACTGAGGACAAAAAGGCTTCTCACCAGGAC 
^ [ A, G] 

\J CCAGATGCATTTAATCTTTTGAAGCTCTGTACTTTAGGAAAAGTCTGATATTTGGCAAAT 
\Ji TTTGATAAACATGGATGACTATGGAATCCTATTTTATAGTATCTGAAGTGGCTTTCATAA 
GGGTCATTGTGAAGTTTTTAGGAGACACCTGCCTGTGGCAGATGGGACAATGATGGCAGT 
CACTAGTGATATTAACACCAGTCAGCTGTCAGGGAATATCATCCAGACCATCAGCAGCTG 
j «n GTAGAGTACAGCTTTCTCAATTGCTTTCCATGTTTTGGATACTTATATGCCCGTTAATAA 

n 4165 3 AGC ACATATT AACTTACTTAATATGGCACTAGGTCAATGAGAAAAGAAGGT AACAT AATT 

j (ab GAAGACAAGAAACTCTTAAGAAAACTGAGGACAAAAAGGCTTCTCACCAGGACACCAGAT 
; " GCATTTAATCTTTTGAAGCTCTGTACTTTAGGAAAAGTCTGATATTTGGCAAATTTTGAT 
|«) AAACATGGATGACTATGGAATCCTATTTTATAGTATCTGAAGTGGCTTTCATAAGGGTCA 
H TTGTGAAGTTTTTAGGAGACACCTGCCTGTGGCAGATGGGACAATGATGGCAGTCACTAG 
%j [T,G] 

r*\ GATATTAACACCAGTCAGCTGTCAGGGAATATCATCCAGACCATCAGCAGCTGGTAGAGT 
^ ACAGCTTTCTCAATTGCTTTCCATGTTTTGGATACTTATATGCCCGTTAATAACAGGTAA 
H AATAGCCAGTACATCATTTCCACATTTACCCATTGAATGTTGCATGTTTTCTTCCTTTCA 
CATATTCATACAGTCCAGATTTTTTTTTGGACTCATGACAGCACATTGGCTTTTCTTTCC 
TTTCAGTTTCATGATTCTTAACCCCAAAGTGCTTTTGCCATGGGAACGGAAGGATAAATT 



4 7 666 GTGGTGATGGCTGGAATGTCAGGGGCAGGGGAGAGTACTGGCGTTAAGTTAAACCAACAG 
ACATCCAGTTTAACCACTGGTAGTTCTCAGTCTACATGTAGTTTATTTCTTCTGTTTATC 
TGCCAATTTTATGTAGATCATCACATTGCCAAAAAAAATCATTTTTGAAACTGTATATAT 
TTTTTATGTCATCATATTTATCTCCTAAATAAGTCTCTTCTTTTCCTACTTTCTGATGCA 
GACATAGACGAATGCTCACTTGGTCAGTATCAGTGCAGCAGCTTTGCTCGATGTTATAAC 
EG, A] 

TACGTGGGTCCTACAAGTGCAAATGTAAAGAAGGATACCAGGGTGATGGACTGACTTGTG 
TGTGTGAGTAGCACTTGTCTCTCAGCTTTAAATTCTAGCAGGAAATACAGGATTACACAA 
AGGCCATTGCTAGGGAAAATAAGGAATAAGATTATCAAAGAAGTATAATTGTCATAATTG 
GTTATATTTGTCTTTGATTTCCACAAACAATAAAATCACTTGCTCAGGTACTTGTAAAAA 
CTAAGGACTCAGTAATACACTATAATCTTAAGAGTATTTTAATCTCTTCACTGAAATCTC 



52613 AGGTAATGGGACCTCTATCACCTGTTGCTAAGGTCTAATAATGTGGATTCATCCTTAATT 
ACTCTTCATACATACAGTCAATATGTAAGGAAATCCTGTTAGCTGTACCTTCAAATTATA 
TTTAAGTGTGACCTTTTCTCACCAACTCCAGTGCTACCACCCTGGCCCAAGCCATCTCTC 
CCCTGGAATACAGGCCCAAAATCCTTTATCCAAATAACTTATGGAATATAGCATTTTTTA 



FIGURE 3DD 



GATTTTAGAAAATCAGTAAGGTACACATACTATATAGTACACACTGAAGTAGTGAAACAT 
EG, A] 

CTGATTTTCCTCTAGTGCTTTTACTGTGAACGTATCAATGTTAAGGAAAGGAAAATGATA 
TTAACTCAGAGATGATGTCTCACAGCATATATTTACTAGCTTGCACAAATTTTTTAATGT 
TAGCAAGATTTAAGGCAAATTTTTTACTTATATTTTAATTGGATCCTATGATGATTATTA 
AAGAAAAAAGTAGTTATCTCTAGAAAGTATAAATTGAGCTCTTGGCAAATGTGAGCAAAA 
CCAGAAATCAGATTTTTTTAAAGTTACGTGTACATTTGTCTATAGAGTTATAATTAAAAG 

52 64 5 GTCTAATAATGTGGATTCATCCTTAATTACTCTTCATACATACAGTCAATATGTAAGGAA 
ATCCTGTTAGCTGTACCTTCAAATTATATTTAAGTGTGACCTTTTCTCACC7VACTCCAGT 
GCTACCACCCTGGCCCAAGCCATCTCTCCCCTGGAATACAGGCCCAAAATCCTTTATCCA 
AATAACTTATGGAATATAGCATTTTTTAGATTTTAGAAAATCAGTAAGGTACACATACTA 
TATAGTACACACTGAAGTAGTGAAACATGCTGATTTTCCTCTAGTGCTTTTACTGTGAAC 
[G,A] 

TATCAATGTTAAGGAAAGGAAAATGATATTAACTCAGAGATGATGTCTCACAGCATATAT 
TTACTAGCTTGCACAAATTTTTTAATGTTAGCAAGATTTAAGGCAAATTTTTTACTTATA 
TTTTAATTGGATCGTATGATGATTATTAAAGAAAAAAGTAGTTATCTCTAGAAAGTATAA 
ATTGAGCTCTTGGCAAATGTGAGCAAAACCAGAAATCAGATTTTTTT7VAAGTTACGTGTA 
CATTTGTCTATAGAGTTATAATTAAAAGTATTGTGCTCATAGCCAGTCTGTGATCTGGGG 

59197 CTACTATTGATGTTGAATTCATTTATATAGT7VATGATAACATTTCCTACTTAATTCATAA 
AAAGACAGCCTATGCTGTTTTCTTGTTCTGAGTTTATATGTTTCTCATGCTTTTTATTAT 
GGTTCATTACAATTTTAATGTTATTTTTAACTAACTAGATCCTTTTGAAACAAATTGGTT 
TGCAAGTGTGAGCTGTTAGGTGCACAGAGAAAAATGAAAATAGAAACTTGCGATTTTATT 
CTAGGCTTGTTACCAAATATTTAGAATACTGTGTTTTATTTAGGTGTTTATAGTCTCATT 
[-,A] 

GACAGTTGTGATTTTAAAATAGAGACCACATCATCTCAACTTCTTTACTGTGAAAATAAT 
GACAATAGTCTTTTCAGAGATGAATCTGTCTAGATGGGAAATTTACATGATTGATCTGAT 
GAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
AAAATAATCTGACAAGTAGTTTCCCCAG7W\ATCTGATTTAGTAAATGTACCAAAAGGAT 
TTAGAAATCTACATCATAAATAAACATTCTATGTTATTTTAGTTCAGACCCTATTTTAAT 

63508 ACAGGCTCCCGGTGCCAGGAAACCTTTACATCAACACTCGATTTGCCATTTGATAGTCCT 
TCATCTGGGAGGAAAAAAAAAGACGGAGGGGAGCTTGAAAAACTGTCATAATGTCCCTGG 
AATATGGTACTTTTAAGAGTTGAGCCTATTCCATTTTGGAGATGATTTATATAAGTTACA 
ACAAAAGAAGGGGACAAAAACATGATTGTTCTATGGAGTTTTTATAACTTTCTGTCACAA 
GAAAGCACGCTTGTCTACAATTTTGTAATATTTCTAGTAAATAAAAGAGGCACTCCCCGT 
[T,Cj 

TCAGAGCACCAAATAAGGAAAGTGTAATTGGATGTCATTGCTGTCAGTCAGCTGGGCTAT 
AAAAGAGAGAGTGGGGTTGCCTCATCCCCTGGGTATCCACAGTCAGCTGTGTCCCTAGAG 
CTTCTTTTCTTTCATTGCTGCCCAGCTGGGTATATTGCAAGTATGGATTATAAGAGGGGA 
AGGGACTTCACTGTTTTAACGTTTGAAACAAAAAGGAAAAAACTCAGAAGTAGTAAGCTA 
AAAACAACTTGTGCAAACGTTCTGGGATTATTACTTAATTTTAAGAATTTTTGCTAAAAA 

7 5153 AAGACATAGGTAGCAAAAAAATGGTCTAGGGCATCCCAACAAAATTATGAATGCATTTGG 
GAAAATGTCATCATTTTGCTGGTGCTTATGAAACTTGCTGTTGGGAGATACCTATGCCTG 
TATGTAGGTATAGTTTATAAGAAACCATTTTAGAAAGAAAAGATAGATTCCATTAATACA 
GTTTTTATCAGAGTACTTGGATTTTGTTTAATTCTTACATATTTTTTCTTAAAACTTTTC 
TCAGTATTTTTATTGTTTAGAGAAATAAAACAAGATAATCATTTTAAATCATAGCACTTA 
[C,G] 

GTTTTCTCTTGTTTTATAAGGAGCAAGGATGCTCTATAGAAAATATAATGTAAGAATAAT 
AAAAGTTTTTGGTTTTTACATAGGTAAAGCAACAGTGTGATTGGATTATGGTGTTTGATT 
CTATTCCATTTTCAGCAAGAAAGCGATGTTAACCACAAAGGAACTAAGAAACATTTAAGA 
TAGGCTGTGTGATTATGATCTTTCAGATCTTTGGCTCCTAATATCTGTTCCTTTATATTC 
TATCACACTCTTCTAACTTTGGTAATCCTTGACAAAAGTGTGCACTTTATAAACAATCCT 

7 6017 CGTCTTCAAAGGTGAAAAAAGGCGTGGTCACACTGGGGAGATTGGATTAGATGATGTGAG 
CTTGAAAAAAGGCCACTGCTCTGAAGAACGCTAACAACTCCAGAACTAACAATGAACTCC 
TATGTTGCTCTATCCTCTTTTTCCAATTCTCATCTTCTCTCCTCTTCTCCCTTTTATCAG 
GCCTAGGAGAAGAGTGGGTCAGTGGGTCAGAAGGAAGTCTATTTGGTGACCCAGGTTTTT 
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CTGGCCTGCTTTTGTGCAATCCCAATGAACAGTGATACCCTCCTTGAAATACAGGGGCAT 
[C,G] 

GCAGACACATCAAAGCCATCTGTGGGTGTTGCCTTCCATCCTGTGTCTCTTTCAGGAAGG 
CATTCAGCATGCGTGAGCCATACCATCCTCCATCCTGATTACAAGGTGCTCCTTGTAGCA 
AATTATGAGAGTGAGTTACGGGAGCAGTTTTTAAAAGAAATCTTTGCAGATGGCTATGAT 
GTTATGTGTTCGGTGTTGTACCATGAGTAGTATTGACTTCCCTTGAGATATGATGTACAA 
TGTGCTTGTGAAATTGACTTACCCTCTTCACTTAAGTTAGTTCTGGCCTGACCTGAACTC 

7 6033 AAAAGGCGTGGTCACACTGGGGAGATTGGATTAGATGATGTGAGCTTGAAAAAAGGCCAC 
TGCTCTGAAGAACGCTAACAACTCCAGAACTAACAATGAACTCCTATGTTGCTCTATCCT 
CTTTTTCCAATTCTCATCTTCTCTCCTCTTCTCCCTTTTATCAGGCCTAGGAGAAGAGTG 
GGTCAGTGGGTCAGAAGGAAGTCTATTTGGTGACCCAGGTTTTTCTGGCCTGCTTTTGTG 
CAATCCCAATGAACAGTGATACCCTCCTTGAAATACAGGGGCATCGCAGACACATCAAAG 
[C,T] 

CATCTGTGGGTGTTGCCTTCCATCCTGTGTCTCTTTCAGGAAGGCATTCAGCATGCGTGA 
GCCATACCATCCTCCATCCTGATTACAAGGTGCTCCTTGTAGCAAATTATGAGAGTGAGT 
TACGGGAGCAGTTTTTAAAAGAAATCTTTGCAGATGGCTATGATGTTATGTGTTCGGTGT 
TGTACCATGAGTAGTATTGACTTCCCTTGAGATATGATGTACAATGTGCTTGTGAAATTG 
ACTTACCCTCTTCACTTAAGTTAGTTCTGGCCTGACCTGAACTCTGACTTTTACTGCCAT 

7 7194 AACGTTTT AAATTGATAT AAGTTTAGGCAGTTGTAGTTCATAACTT ATGTTGCTCATGTT 

GTGCTGTGTCAGGATGGGATAGGAAGCAAGTCCCATGCTTAGAGGCATGGGATGTGTTGG 
AACGGGATTTACACACACTGGAGGAGCAGGGCAAGTTGGAATTCTAAGATCCATGAACCC 
CCAACTGTATTTCCTCCCTGCATATTTTACCAATATATTAAAAAACAATGTAACTTTTAA 
AAGGCATCATTCCTGAGGTTTGTCTTAATTTCTGATTAAGTAATCAGAATATTTTCTGCT 
[A,G] 

TTTTTGCCAGGAATCACAAAGATGATTAAAGGGTTGGAAAAAAAGATCTATGATGGAAAA 
TTAAAGGAACTGGGATTATTGAGCCTGGAGAAGAGAAGACTGAGGGGCAAACCATTGATG 
GTTTTCAAGTATATGAAGGGTTGGCACAGAGAGGGTGGCGACCAGCTGTTCTCCATATGC 
ACTAAGAATAGAACAAGAGGAAACTGGCTTAGACTAGAGTATAAGGGAGCATTTCTTGGC 
AGGGGCCATTGTTAGAATACTTCATAAAAAAAGAAGTGTGAAAATCTCAGTATCTCTCTC 

7 7 508 TCACAAAGATGATTAAAGGGTTGGAAAAAAAGATCTATGATGGAAAATTAAAGGAACTGG 
GATTATTGAGCCTGGAGAAGAGAAGACTGAGGGGCAAACCATTGATGGTTTTCAAGTATA 
TGAAGGGTTGGCACAGAGAGGGTGGCGACCAGCTGTTCTCCATATGCACTAAGAATAGAA 
CAAGAGGAAACTGGCTTAGACTAGAGTATAAGGGAGCATTTCTTGGCAGGGGCCATTGTT 
AGAATACTTCATAAAAAAAGAAGTGTGAAAATCTCAGTATCTCTCTCTCTTTCTAAAAAA 
[T,A] 

TAGATAAAAATTTGTCTATTTAAGATGGTTAAAGATGTTCTTACCCAAGGAAAAGTAACA 
AATTATAGAATTTCCCAAAAGATGTTTTGATCCTACTAGTAGTATGCAGTGAAAATCTTT 
AGAACTAAATAATTTGGACAAGGCTTAATTTAGGCATTTCCCTCTTGACCTCCTAATGGA 
GAGGGATTGAAAGGGGAAGAGCCCACCAAATGCTGAGCTCACTGAAATATCTCTCCCTTA 
TGGCAATCCTAGCAGTATTAAAGAAAAAAGGAAACTATTTATTCCAAATGAGAGTATGAT 

7 7 557 AAAGGAACTGGGATTATTGAGCCTGGAGAAGAGAAGACTGAGGGGCAAACCATTGATGGT 
TTTCAAGTATATGAAGGGTTGGCACAGAGAGGGTGGCGACCAGCTGTTCTCCATATGCAC 
TAAGAATAGAACAAGAGGAAACTGGCTTAGACTAGAGTATAAGGGAGCATTTCTTGGCAG 
GGGCCATTGTTAGAATACTTCATAAAAAAAGAAGTGTGAAAATCTCAGTATCTCTCTCTC 
TTTCTAAAAAATTAGATAAAAATTTGTCTATTTAAGATGGTTAAAGATGTTCTTACCCAA 
[G,A] 

GAAAAGTAACAAATTATAGAATTTCCCAAAAGATGTTTTGATCCTACTAGTAGTATGCAG 
TGAAAATCTTTAGAACTAAATAATTTGGACAAGGCTTAATTTAGGCATTTCCCTCTTGAC 
CTCCTAATGGAGAGGGATTGAAAGGGGAAGAGCCCACCAAATGCTGAGCTCACTGAAATA 
TCTCTCCCTTATGGCAATCCTAGCAGTATTAAAGAAAAAAGGAAACTATTTATTCCAAAT 
GAGAGTATGATGGACAGATATTTTAGTATCTCAGTAATGTCCTAGTGTGGCGGTGGTTTT 
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